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OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



February 7, 2002, 11:03:09 ; Search time 3842.15 Seconds 

(without alignments) 
1807.663 Million cell updates/sec 

US-09-394-745-6489 
421 

1 ggggccagcacgcgtccggc ctcgacaagatctaaggggg 421 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



1472140 seqs, 8248589755 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



2944280 



Database 



GenEmbl : * 
1 : gb_ba : * 
2 : gb_htg : * 
3: gb_in:* 
4 : gb_om : * 
5 : gb_ov : * 
6 : gb_pat : * 
7 : gb_ph : * 
8: gb_pl:* 
gb_pr : * 
gb_ro: * 
gb_sts : * 
gb_sy : * 
gb_un : * 
gb_vi : * 
em__ba : * 
em__f un : * 
em_hum : * 
em_in : * 
em_om : * 
em_or : * 
em_ov : * 
em_pat : * 
em_ph : * 
em_pl : * 
em_ro : * 
em sts : * 
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em_sy : 
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vi : * 
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em_ 


_htgo_inv: * 
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em 


htgo rod:* 
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em_ 


_htg_hum: * 
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em 


htg_inv : * 
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em_ 


_htg__rod: * 


36 


em_ 


_htg__other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 

AP003626 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AP003626 163055 bp DNA HTG 18-MAY-2001 

Oryza sativa chromosome 6 clone P0429G06, *** SEQUENCING IN 
PROGRESS ***, in ordered pieces. 
AP003626 

AP003626.1 GI:14141687 
HTG; HTGS_PHASE2. 

Oryza sativa (cultivar : Nipponbare ) DNA, clone : P0429G06 . 
Oryza sativa 

Eukaryota; Viridiplantae; Streptophyta ; Embryophyta; Tracheophyta ; 
Spermatophyta ; Magnoliophyta; Liliopsida; Poales; Poaceae; 
Ehrhartoideae ; Oryzeae; Oryza. 

1 (sites) 

Sasaki, T., Matsumoto,T. and Yamamoto, K . 

Oryza sativa nipponbare (GA3 ) genomic DNA, chromosome 6, PAC 
clone:P0429G06 

Published Only in DataBase (2001) In press 

2 (bases 1 to 163055) 

Sasaki, T., Matsumoto,T. and Yamamoto, K. 
Direct Submission 

Submitted (17-MAY-2001) Takuji Sasaki, National Institute of 
Agrobiological Resources, Rice Genome Research Program; Kannondai 
2-1-2, Tsukuba, Ibaraki 305-8602, Japan 

(E-mail : tsasaki@abr . af f rc . go . jp, URL : http : //rgp . dna . af f rc . go . jp/, 
Tel:81-298-38-7441, Fax:81-2 98-38-7 4 68) 

NOTE: It currently consists of 1 contigs. Gaps between the contigs 
are represented as runs of N. The order of the pieces is believed 
to be correct as given, however the sizes of the gaps between them 
are based on estimates that have provided by the submitter. This 
sequence will be replaced by the finished sequence as soon as it is 
available and the accession number will be preserved. 

* NOTE: This is a 1 working draft 1 sequence. 

* This sequence will be replaced 

* by the finished sequence as soon as it is available and 

* the accession number will be' preserved. 

Location/Qualifiers 
1. .163055 

/organism="Oryza sativa" 
/cultivar="Nipponbare" 
/db_xref="taxon: 4530" 
/chromosome="6" 
/clone="P0429G06" 
47965 a 33832 c 33488 g 47470 t 300 others 



Query Match 50.4%; Score 212; DB 2; Length 163055; 

Best Local Similarity 77.3%; Pred. No. l.le-38; 

Matches 310; Conservative 0; Mismatches 81; Indels 10; Gaps 



4; 



Qy 19 gcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgc 78 

I I I I I I I I II I I I M I I I I I I I I I I I II I I I I I I I I I I I I II I I II I II I 

Db 122931 GCAGGAGCAGGCCAAGCTGGCAGCCTCCTCTATCGACAGCATTGTCAATGGTGGTGATTC 122990 

Qy 79 cgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagtt 138 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 122 991 CGGGAAGGAACCAATTATTGCTG GTCACGCTGAAGTAGCTGTTGCTCAAGTCGAGGT 12304 7 

Qy 139 gcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggc 198 

I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I III 
Db 12304 8 GAAGAC GCTTGTTGCGCAGCCTGCTGAGATTGCTGGTCCTAGTGAGGGGGTCACGGT 123104 

Qy 199 gatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttgg 258 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 123105 GAACCCCAA GGGGAGGGAAGGACCAAATCGGTGCTCCACTTGTCGGAAGAGGGTTGG 123161 



Qy 259 acttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccga 318 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I I M I I I I I I I I I I I I I I I I I M I I I 
Db 123162 TCTTACCGGATTCAACTGCCGATGTGGCAACTTGTACTGCGCAATGCACCGCTATTCCGA 123221 



Qy 319 caagcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaagggcta 378 

I I 1 I I 1 I I I I I I I I I II I I I I I II I I I I I I I III II I I I I I I II I I I I I 
Db 123222 TAAGCATGACTGCCAGTTTGACTATCGGACCGCTGCTAGGGATGCTATTGCCAA-GGCTA 12 3280 



Qy 37 9 atccggtggtgaaggcagacaagctcgacaagatctaaggg 419 

I I I I I I I I I I I I I I I I II II II I I I I I II II Ml 
Db 123281 ATCCGGTGGTGAAGGCGGAGAAACTTGACAAAATTTAGGGG 123321 



RESULT 2 

AP003711 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



clone:P0417G12. 
Embryophyt a ; Tracheophyta ; 



AP003711 165909 bp DNA HTG 31-MAY-2001 

Oryza sativa chromosome 6 clone P0417G12, *** SEQUENCING IN 
PROGRESS in ordered pieces. 

AP003711 

AP003711.1 GI:14270111 
HTG; HTGS_PHASE2. 

Oryza sativa (cultivar : Nipponbare ) DNA, 
Oryza sativa 

Eukaryota; Viridiplantae; Streptophyta ; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; 
Ehrhartoideae; Oryzeae; Oryza. 

1 (sites) 

Sasaki, T., Matsumoto,T. and Yamamoto,K. 

Oryza sativa nipponbare (GA3 ) genomic DNA, chromosome 6, PAC 
clone: P0417G12 

Published Only in DataBase (2001) In press 

2 (bases 1 to 165909) 

Sasaki, T., Matsumoto,T. and Yamamoto,K. 
Direct Submission 

Submitted ( 30-MAY-2001 ) Takuji Sasaki, National Institute of 



Agrobiological Resources, Rice Genome Research Program; Kannondai 
2-1-2, Tsukuba, Ibaraki 305-8602, Japan 

(E-mail :tsasaki@abr. affrc. go. jp, URL : http : //rgp . dna . af f rc . go . jp/, 
Tel: 81-298-38-7441, Fax : 81-2 98-38-7 4 68 ) 
COMMENT NOTE: It currently consists of 1 contigs. Gaps between the contigs 

are represented as runs of N. The order of the pieces is believed 
to be correct as given, however the sizes of the gaps between them 
are based on estimates that have provided by the submitter. This 
sequence will be replaced by the finished sequence as soon as it is 
available and the accession number will be preserved. 

* NOTE: This is a 'working draft 1 sequence. 

* This sequence will be replaced 

* by the finished sequence as soon as it is available and 

* the accession number will be preserved. 
FEATURES Location/Qualifiers 

source 1. .165909 

/organism="Oryza sativa" 

/cultivar="Nipponbare" 

/db_xref="taxon:4530" 

/chromosome^" 6" 

/clone="P0417G12" 
BASE COUNT 47417 a 34811 c 35629 g 47902 t 150 others - 
ORIGIN 



Query Match 50.4%; Score 212; DB 2; Length 165909; 

Best Local Similarity 77.3%; Pred. No. l.le-38; 

Matches 310; Conservative 0; Mismatches 81; Indels 10; Gaps 4; 

Qy 19 gcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgc 78 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I M I M I I M I II I 

Db 42994 GCAGGAGCAGGCCAAGCTGGCAGCCTCCTCTATCGACAGCATTGTCAATGGTGGTGATTC 43053 

Qy 7 9 cgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagtt 138 

II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db -4 3054 CGGGAAGGAACCAATTATTGCTG GTCACGCTGAAGTAGCTGTTGCTCAAGTCGAGGT 4 3110 

Qy 139 gcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggc 198 

I I I I I II I I I I I I I I I I I M I I I I I I II II I I I I I I III 
Db 43111 GAAGAC GCTTGTTGCGCAGCCTGCTGAGATTGCTGGTCCTAGTGAGGGGGTCACGGT 43167 

Qy 199 gatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttgg 258 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I 
Db 43168 GAACCCCAA GGGGAGGGAAGGACCAAATCGGTGCTCCACTTGTCGGAAGAGGGTTGG 43224 

Qy 259 acttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccga 318 

I I I I I I I I I I I I I I I I I I I I I I I I. I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 43225 TCTTACCGGATTCAACTGCCGATGTGGCAACTTGTACTGCGCAATGCACCGCTATTCCGA 43284 

Qy 319 caagcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaagggcta 37 8 

I I I I I I I I I I I I I I I I I I I I Mill I I I I I I III II I I I I I I I I I I I I I 
Db 4 3285 TAAGCATGACTGCCAGTTTGACTATCGGACCGCTGCTAGGGATGCTATTGCCAA-GGCTA 43343 

Qy 379 atccggtggtgaaggcagacaagctcgacaagatctaaggg 419 

I I I I I I I I I I I I I I I I II II II I I I I I II II III 
Db 4 334 4 ATCCGGTGGTGAAGGCGGAGAAACTTGACAAAATTTAGGGG 4 3384 



RESULT 3 
AF139499 

LOCUS AF139499 1031 bp mRNA PLN 10-JUN-1999 

DEFINITION Prunus armeniaca p85RF mRNA, complete cds . 

ACCESSION AF139499 

VERSION AF139499.1 GI:5031280 

KEYWORDS 

SOURCE apricot. 

ORGANISM Prunus armeniaca 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 
Rosidae; eurosids I; Rosales; Rosaceae; Prunus. 
REFERENCE 1 (bases 1 to 1031) 

AUTHORS Mbeguie-A-Mbeguie, D. and Fils-Lycaon, B . R . 

TITLE Molecular cloning and nucleotide sequence of a putative protein 

from apricot (Prunus armeniaca var. Bergeron) 
JOURNAL Unpubl i shed 
REFERENCE 2 {bases 1 to 1031) 

AUTHORS Mbeguie-A-Mbeguie, D. and Fils-Lycaon, B . R. 
TITLE Direct Submission 

JOURNAL Submitted ( 31-MAR-1999) INRA, Domaine de Duclos, Petit-Bourg 97170, 
France 

FEATURES Location/Qualifiers 
source 1. .1031 

/organism=' f Prunus armeniaca" 
/ var iety=" Bergeron" 
/db_xref="taxon: 36596" 

/tissue_type="f ruit mesocarp and endocarp" 

/dev_stage="ripe" 
gene 1. .1031 

/gene="p85RF" 
CDS 113. .634 

/gene="p85RF" 

/note="similar to Phaesolis vulgaris PVPR3 protein" 

/codon_start=l 

/product=" unknown" 

/protein_id="AAD38146. 1" 

/db_xref="GI: 5031281" 

/trans 1 at ion="MEHEETGCQPHPEGPILCVNNCGFFGSVATRNMCSKCHKDMMLK 
EEQAKLAASSFGNIVNGTSNSNGNEPVVAAGVDVQAHLVEPKTISLQPSFSFGSGSGG 
SGEAKPEGPKRCGTCNKRVGLTGFNCRCGHLFCAVHRYSDKHDCPYDYHTAARDVIAK 
ANPVVKADKLEKI " 

BASE COUNT 270 a 185 c ' 254 g 322 t 

ORIGIN 



Query Match 22.9%; Score 96.4; DB 8; Length 1031; 

Best Local Similarity 56.2%; Pred. No. 4.2e-12; 

Matches 223; Conservative 0; Mismatches 167; Indels 7; Gaps 2; 

Qy 21 aggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgccg 80 

I I I I I I I I I I I I II I I I I I I I I I II I III II II II I I 
Db 24 6 AGGAGCAGGCAAAGCTCGCTGCATCATCCTTTGGAAACATTGTCAATGGAACATCAAACA 305 

Qy 81 tcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagttgc 140 

III I I I I I I I I I I I I I I I I I I I I 



Db 



306 GCAATGGAAATGAACCTGTTGTTGCTGCTGGTGTGGATGTTCAAGCCCATCTGGTGGAGC 365 



Qy 141 aaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggcga 200 

I I I I I I I I I I I I I I I I I I I I I 

Db 366 CAAAAACTATCTCTTTACAACCATCCTTTTCCTTTGGTTCAGGTTCAGGAGGGAGTGGTG 425 

Qy 201 tctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttggac 260 

Ml I I I I II II II III I I I I I I I I I I I I I I I I I 

Db 42 6 AGGCAAA GCCAGAAGGCCCAAAACGTTGCGGAACTTGCAACAAGAGAGTTGGAT 47 9 

Qy 261 ttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccgaca 320 

Til II I I I I I II I II I I I I I II I I Ml III I II II II M I I I I 
Db 4 80 TAACAGGGTTCAATTGTCGGTGTGGTCACCTTTTCTGTGCAGTACATCGTTATTCAGACA 539 

Qy 321 agcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaagggctaat 380 

I I I I I II II I I I I I I I I I I I I I I III I I I I I I I M I I 

Db 54 0 AACATGACTGCCCTTATGATTACCACACTGCTGCACGCGATGTGATTGCTAA-AGCCAAC 598 

Qy 381 ccggtggtgaaggcagacaagctcgacaagatctaag 417 

II II II I I I I I II I I I I I II II II I I I I I 

Db 599 CCTGTTGTAAAGGCTGATAAGCTTGAAAAAATCTAAG 635 



RESULT 4 

AC006085 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



JOURNAL 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



REFERENCE 
AUTHORS 



AC006085 105807 bp DNA PLN 17-MAY-1999 

Arabidopsis thaliana chromosome I BAC F11M15 genomic sequence, 
complete sequence. 
AC006085 

AC006085.1 GI:4165340 
HTG. 

thale cress. 
Arabidopsis thaliana 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 
Rosidae; eurosids II; Brassicales; Brassicaceae ; Arabidopsis. 

1 (bases 1 to 105807) 

Federspiel,N.A. , Palm, C. J., Conway, A. B., Conn,L., Hansen, N.F., 
Altafi,H., Araujo,R., Huizar,L., Rowley, D., Buehler,E., Dunn, P., 
Gonzalez,A., Kremenetskaia, I . , Kim,C, Lenz,C, Li, J., Liu,S., 
Luros,S., Schwartz, J., Shinn,P., Toriumi,M., Vysotskaia , V . S . , 
Walker, M . , Yu,G., Ecker,J., Theologis f A. and Davis, R.W. 
Unpublished 

2 (bases 1 to 105807) 

Federspiel, N. A. , Palm, C. J., Conway, A. B., Conn,L., Hansen, N.F., 
Altafi,H., Araujo,R., Huizar,L., Rowley, D., Buehler,E., Dunn, P., 
Gonzalez, A., Kremenetskaia, I . , Kim,C, Lenz,C, Li, J., Liu,S., 
Luros,S., Schwartz, J., Shinn,P., Toriumi,M., Vyotskaia, V. , 
Walker, M . , Yu,G., Ecker,J., Theologis,A. and Davis,R.W. 
Direct Submission 

Submitted ( 03-DEC-l 998 ) DNA Sequencing and Technology Center, 
Stanford University, 855 California Avenue, Palo Alto, CA 94304, 
USA 

3 (bases 1 to 105807) 

Federspiel, N. A. , Palm, C. J., Conway, A. B., Conn,L., Hansen, N.F., 
Altafi,H., Araujo,R., Huizar,L., Rowley, D. , Buehler,E., Dunn,P., 
Gonzalez, A., Kremenetskaia, I . , Kim,C, Lenz,C, Li, J., Liu,S., 



TITLE 
JOURNAL 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



gene 
CDS 



mRNA 
gene 
CDS 



Luros,S., Schwartz, J. , Shinn,P., Toriumi,M., Vyotskaia, V . , 
Walker, M . , Yu,G., Ecker,J., Theologis,A. and Davis, R.W. 
Direct Submission 

Submitted (21- JAN-1999) DNA Sequencing and Technology Center, 
Stanford University, 855 California Avenue, Palo Alto, CA 94304, 
USA 

4 (bases 1 to 105807) 

Federspiel, N. A. , Palm, C. J., Conway, A. B., Conn,L., Hansen, N.F., 
Altafi,H., Araujo,R., Huizar,L., Rowley, D., Buehler,E., Dunn, P., 
Gonzalez, A., Kremenetskaia, I . , Kim,C, Lenz,C, Li,J., Liu,S., 
Luros,S., Schwartz, J., Shinn,P., Toriumi,M., Vyotskaia, V. f 
Walker, M., Yu,G., Ecker,J., Theologis,A. and Davis, R.W. 
Direct Submission 

Submitted ( 17-MAY-1999) DNA Sequencing and Technology Center, 
Stanford University, 855 California Avenue, Palo Alto, CA 94304, 
USA 

On Jan 21, 1999 this sequence version replaced gi: 4079615. 
BAC F11M15 is near molecular marker m280 on Chromosome I. 
e-mail for correspondence: arab@sequence.stanford.edu 
Genes with similarity to proteins in the databases are described as 
'putative 1 , ! -like ! or 'similar to'. Genes that have EST 
similarity but no significant protein similarity are described as 
'unknown proteins 1 . Genes that are annotated based only on gene 
prediction software are described as 'hypothetical proteins'. 
The software programs used to predict genes include: Grail 
(Informatics Group, Oak Ridge National Laboratory, 

http: //compbio.ornl . gov/section/index . html ) , GENSCAN (Chris Burge, 
http : / /gnomic . Stanford . edu/~chris/GENSCANW . html ) , Fexa ( V . Solovyev 
& A.Salamov, Sanger Centre, http://genomic.sanger.ac.uk/), and 
NetPlantGene (S.M. Hebsgaard, et al., CBS, Technical University of 
Denmark, http : / /www . cbs . dtu . dk/Net PlantGene . html ) . 

Location/Qualifiers 

1. .105807 

/organism="Arabidopsis thaliana" 

/db_xref="taxon:3702" 

/chromosome="I" 

/clone="FHM15" 

complement (673. . 1881) 

/gene="FllM15.1" 

complement (join (67 3. .821, 1272. .1881)) 

/gene-"FHM15.1" 

/note="Hypothetical protein" 

/ codon__start=l 

/protein_id="AAD30628 .1" 

/db_xref="GI : 4836926" 

/translation="MLFRSVHHIVARFSNSTSTPIHRFFYSPSLLRRRSSFNASLISR 
CCSSVSDVDVARDAVVKIFSFSREPNVVQPWQTTEKEYSSSGIHKHLSFSLSRVISLL 
DFQILKNRMVLFGFGISMLGFAISGRRILTNAHVVGDHLYLQVRKHGSPTKYKAEVKA 
FRYGCDLAILGIDSEEFWEDINPLELGGIPFIGETVYALGYPRGVKKRRRVVKGEEVD 
ELGCRRRKQKMKQKDHWAMNLIKAHNANGFRYLS " 

join(2603. .2871,3022. .3136,3876. .3994,4100. .4352) 

/gene="FHM15.2" 

2603. .4352 

/gene="FllM15.2" 

join{2648. .2871,3022. .3136,3876. .3994,4100. .4151) 
/gene="FllM15.2" 

/note="Unknown protein; Location of ESTs 118N12T7, 



gb|T43878 and 118N12XP, gb | AA395082 " 
/ codon_start=l 
/protein_id="AAD30629.1" 
/db_xref="GI : 4836927" 

/translation="MQFFGGSEISPSPPVPTASGNNAHMMYVFNRNGVCLLYKEWNRP 
LHTLNPQQDHKLMFGLLFSLKSLTAKMDPVNADKGNLGVPQLPGQGCSFHSFRTNTYK 
LSFMETPSGIKI ILVTHPKTGDLRESLKYIYSLYVEYVVKNPIYSPGSPIKSELFNTA 
LDQYVRSIS" 

complement (6216. . 7430) 
/gene="FllM15.3" 
complement (6216. . 7430) 
/gene="FllM15.3" 

/note="Similar to protein kinases" 
/codon_start=l 
/protein_id="AAD30630. 1" 
/db_xre f = "GI : 4 8 3 6 92 8 " 

/ trans la tion="METRPSSSSSLSPATNLNLDRLKVLKLLGKGATGTVFLVHDSVS 

DSSVSSPFALKLVDKSSASSLRRARWEIQILRRLSDDTNPNPFLPKLLASSESSEFIA 

WALPYCSGGDLNVLRQRQNDGVFSSSVIKFYLAEIVCALDHLHTMGIAYRDLKPENIL 

LQESGHVTLTDFDLSCSLNKPTRPEFYHLSDPEPDPNPESNLSHNKKSLRIFRQKKKK 

TKSARVNPITRRRLSFSGGERSNSFVGTDEYISPEVIRGDGHDFAVDWWALGVLTYEM 

MYGETPFKGRNKKETFRNVLVKEPEFAGKPSDLTDLIRRLLVKDPTKRFGFWRGAAEI 

KEHAFFKGVRWELLTEVLRPPFIPLRDDGDLTGKVTEESGFGIKEYFEKLKTPPLPLP 

HECSENNPFVDF" 

15856. .18162 

/gene="FHM15.4" 

join (1585 6. . 16360, 1 6458 . .17126, 17201. . 17328, 174 01. 

17680. .18162) 
/gene="FHM15. 4" 
/note="Hypothetical protein" 
/codon_start=l 
/protein_id="AAD30631 .1" 
/db_xref="GI : 4836929" 

/trans la tion="MTSLSTPSRLTLLRGAHQIPHQIKLISPGTKTSPESPPPGHTCA 

FEIFFSECGLFFPLPDILIDLMHEFGIALPQLCPNVIRTVLCLLTLAEEDGFRLALSD 

FLQLYAVKKSRTNNTFFLSPRKGFRVFEDFPEKDEHWRKSYFFFPVNELTYEEKSDLF 

VSDWTTRAVKVTIPPVSRTFASHFSNFCRHDLGWEALTSDRIRASGKRLRTRPDLAIS 

SPPFSPRIFCKAEMSGPSYRAEKKRERERADQEKARILSDRVVMSGTLEDSKLRAAEK 

AKHLVTDNKVKSRQEPGKTPTSATKNSGAPTSEAGSLAKKPQSKDSATREAEKKRKKP 

EDDSCPSPRSSCPRLEEKNTGAGKTKEDPKEVPPSLVVLSSQESEARRSEPRSTPPPA 

PPSSFADIMRTLVAPGSAIAPFDEMKEVNKDNYLRFAGKLGKLILEFNSVFYSHEDQL 

SDKDSELDNFRRSEDENAHTVERASRVLKRMKDVEPRIQELEVNNIDLMAKLEEKLRK 

LEEEQAGKIDVARKEERKKVKARFHEFSSKYGNFFRQSEEVKTLKVRVAETRANRELL 

EEIQKGEIPDIDVELESVRADEAKFVQRAAEQKTPRPDPVELTLLLADTPPEMTAENA 

TQDEAIAINERGSNKGSTSEAGIVAMFPVDVEKESEQAE" 

complement (18671. -.23596) 

/gene="FHM15.5" 

complement (join (18 671. .19513, 19645. . 202 93, 204 68 . .20606, 

20726. .21201,21421. .22481,22680. .23072,23270. .23596)) 

/gene="FHM15.5" 

/note="Hypothetical protein" 

/codon_start=l 

/protein_id="AAD30632 .1" 

/db_xref ="GI : 4836930" 

/translation="MSNTDDQSSGGQASTLDNIKQLLQQLSEKTDRQQLAVTTLSDKF 
VSFQERCNGQLAALAVNHSEIRSTHNAYAERVDSLQERHGLQVFEHDSTFSAPRAQAS 



RIRRKLLNLQDQMWAMNAKVHQATTSAPEVEKVIEATRRTPFTSRISKLRIRDFRDFK 

LPGYNGKGDPKEYLTFFQVIAGRVPLEPHEEDAGLCKLFSESLSGPALTWFTQLEEGS 

IDNFKQLSTAFIKQYGYFIKSDDALHQATNWINAEEERAFLATKFNAAPKASPPATSK 

KPSELRKPAAGTFSVEKEPLKTSPKASPSKPPFSPKGKNGALPSNKWVRDPNAYCEIH 

KMNGHATKDYKALGRLLAAKFASGE I HDI DVTT I DLAQTAYDVLDFEI SS PPNKKQKN 

EPAPVTPKGPKKMVKVIMGGSKLFRDSVSAIKQHERKVVAPVAKRPKTNQIDCPEISF 

SEEETSDLDKPHDDALVISLDVGNCEVQRVLIDTGSSVDLIFLDTLVRMGISKKDIKG 

ALSPLVSFTSETSMLLRTITLPVTAQGIVKMIEFTVFDRPAAYNI ILGTPWLYEMKAV 

PSTYHQCVKFPTPTGVKQILGSQRTSRIVGEMTGISTEVISHELNVDPTFKPVKQKYR 

KLGPDRAEAVNAEVVRLLEEGLIREVKYPEWLANPVVVKKKNGKWRVCVDFTDLNKAC 

PKDCFPLPHIDRHVESTTGHEMLSFMDAFSAYNQILMNPDDQEKTSFITERGTYCSKI 

SSAKQWRFISTTCEFLGYIVTERGIEANPRQINAFLAMSSPRTISEVRRLTGRVAALN 

SAVSSVLVRVERSDERPIFYVSKSFSDAETRYPMMEKLALAVVTSARKLRPYFQSHPI 

VVLTSLPLRAILHSPSQSGRLAKWAIELSEFDLEFRARTSLKSQVLADFLIELPLATA 

ESDVPSQPWILHVDGASSKQGSGAGVRLKLPTGEVLEQSFRLAFNASNNEAEYESLLA 

GLRLAIGVGVRALQAHCDSQLVASQYSGDYEAKDSRMEAYLDQETNFVTTRAMRRRLD 

EKAGSPEPQQPDDGDDSPEFVPPVAEDEVPITDNPEVSNQIPHDWGADWREPIRNYIL 

HGVLPPDKWAARKLKATNARFCIANDILYRRLISAPDAICIFGEQPRTVIKKIHDGTC 

GNHSGGRSLAFKVKKYGYFWPTMMADCEAYARRCEQCQKHAPYILQPAELLTSVSAPY 

PYMKWSMDIIGPLHVSTRGVRFLLVLTDYFSKWVKAATYSNITQVQVRAFIWKEI ICR 

HGLPYEIVTDNGPQFISKHFESLCVRRQPVLWV" 

30507. .32896 

/gene="FllM15.6" 

join (30507. .3094 6, 31035. . 31117 , 3138 6 . . 314 7 4, 31572 . 

31770. .31897,32160. .32896) 
/gene="FllM15. 6" 

/note="Hypothetical protein; protein contains ethylene 
responsive element binding protein signature motif" 
/codon_start=l 
/protein_id="AAD30633 . 1 " 
/db_xref«"GI : 4836931" 

/trans lation="MINPHGGGGEGGEVPKVADFLGVSKSGDHHTDHNLVPYNDIHQT 

NASDYYFQTNSLLPTVVTCASNAPNNYELQESAHNLQSLTLSMGSTGAAAAEVATVKA 

SPAETSADNSSSTTNTSGGAIVEATPRRTLETFGQRTSI YRGVTRHRWTGRYEAHLWD 

NSCRREGQSRKGRQGGYDKEEKAARAYDLAALKYWGPSTTTNFPITNYEKEVEEMKNM 

TRQEFVASIRRKSSGFSRGASMYRGVTRHHQHGRWQARIGRVAGNKDLYLGTFSTEEE 

AAEAYDIAAIKFRGLNAVTNFEINRYDVKAILESNTLPIGGGAAKRLKEAQALESSRK 

REEMIALGSNFHQYGAASGSSSVASSSRLQLQPYPLSIQQPFEHLHHHQPLLTLQNNN 

DISQYHDSFSYIQTQLHLHQQQTNNYLQSSSHTSQLYNAYLQSNPGLLHGFVSDNNNT 

SGFLGNNGIGIGSSSTVGSSAEEEFPAVKVDYDMPPSGGATGYGGWNSGESAQGSNPG 

GVFTMWNE" 

38281. .38802 

/gene="FllM15.7 n 

38281. .38802 

/gene= ,, FHM15.7" 

/note="Unknown protein; Location of ESTs 136P4T7, 
gb|T45618 and 136P4XP, gb|AA394606" 
/codon_start=l 
/protein_id="AAD30634 .1" 
/db_xref ="GI : 4836932" 

/trans lation="MDHDKTGCQSPPEGPKLCTNNCGFFGSAATMNMCSKCHKDMLFQ 

QEQGAKFASAVSGTSSSSNIIKETFTAALVDIETKSVEPMTVSVQPSSVQVVAEVVAP 

EEAAKPKGPSRCTTCNKRVGLTGFKCRCGSLFCGTHRYADVHDCSFNYHAAAQEAIAK 

ANPVVKAEKLDKI" 

40400. .41701 



Query Match 21.8%; Score 91.8; DB 8; Length 105807; 

Best Local Similarity 70.3%; Pred. No. 2.5e-ll; 

Matches 137; Conservative 0; Mismatches 57; Indels 1; Gaps 



1; 



Qy 222 ggccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggt 281 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 38 60 9 GACCAAGCCGATGTACTACTTGCAATAAGCGGGTTGGCCTGACTGGATTCAAATGTCGCT 38 668 

Qy 282 gtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgact 341 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 38669 GTGGGAGCCTCTTCTGCGGAACACACCGCTATGCAGACGTACATGACTGCTCCTTCAATT 38728 

Qy 34 2 aacggactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaag 4 01 

II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 3872 9 ACCATGCTGCTGCGCAAGAGGCGATAGCTAA-GGCAAACCCGGTTGTGAAAGCAGAGAAG 387 87 

Qy 402 ctcgacaagatctaa 416 

II I I I I I I I I I I 
Db 38788 CTTGACAAAATCTGA 38802 



RESULT 5 

AF140722 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



AF140722 844 bp mRNA PLN 07-JUN-2000 

Oryza sativa pathogenesis-related protein' mRNA, complete cds . 
AF140722 

AF140722.1 GI:8307827 

Oryza sativa. 
Oryza sativa 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; 
Ehrhartoideae; Oryzeae; Oryza. 

1 (bases 1 to 844) 
Mukhopadhyay, A. and Tyagi,A.K. 

Isolation and characterization of a novel pathogenesis-related 
protein gene (OSPR) from rice (Oryza sativa cv. Pusa Basmati 1) 
Unpublished 

2 (bases 1 to 844) 
Mukhopadhyay , A . and Tyagi,A.K. 
Direct Submission 

Submitted (06-APR-1999) Department of Plant Molecular Biology, 
University of Delhi, South Campus, Benito Juarez Road, New Delhi 
110021, India 

Location/Qualifiers 

1. .844 

/organism= "Oryza sativa" 
/cultivar="Pusa Basmati 1" 
/db_xref="taxon: 4530" 
110. .604 
/note="OSPR" 
/codon_start=l 

/product="pathogenes is -related protein" 
/protein_id="AAF74 34 4 . 1 " 
/db_xref="GI : 8307828" 

/translation="MAQRDKKDQEPTELRAPEITLCANSCGFPGNPATQNLCQNCFLA 
ATASTSSPSSLSSPVLDKQPPRPAAPLVEPQAPLPPPVEEMASALATAPAPVAKTSAV 



NRCSRCRKRVGLTGFRCRCGHLFCGEHRYSDRHGCSYDYKSAARDAIARDNPVVRAAK 
IVRF" 

BASE COUNT 206 a 225 c 249 g 164 t 

ORIGIN 



Query Match 21.5%; Score 90.4; DB 8; Length 844 ; 

Best Local Similarity 64.4%; Pred. No. le-10; 

Matches 150; Conservative 0; Mismatches 82; Indels 1; Gaps 1; 

Qy 188 ggggtggcggcgatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcagg 247 

II I I I I I I I I III I I I I I I II I I I I I I I I I 

Db 377 GCGACGGCGCCGGCGCCGGTCGCCAAGACGTCGGCGGTGAACCGGTGCTCCAGGTGCCGG 4 36 

Qy 24 8 aagagggttggacttacgggattcaactgccggtgtgggaacttgtactgcgcactccac 307 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I II III 
Db 4 37 AAGCGTGTCGGCCTCACCGGGTTCCGGTGCCGGTGCGGCCACCTGTTCTGCGGCGAACAC 4 96 

Qy 308 cgctactccgacaagcacgactgcaagttcgactaacggactgctgccaaggacgccatt 3 67 

I I I I I II II II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 97 CGGTACTCCGACCGCCACGGCTGCAGCTACGACTACAAGTCGGCGGCAAGGGACGCCATC 556 

Qy 368 gccaagggctaatccggtggtgaaggcagacaagctcgacaagatctaagggg 420 

II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 557 GCC-AGGGACAACCCGGTGGTGCGCGCGGCCAAGATCGTTAGGTTCTGAGAGG 608 



RESULT 6 
AC090871/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



AC090871 .165873 bp DNA HTG 28-JUN-2001 

Oryza sativa chromosome 3 clone OS JNBbOO 60 J21 , *** SEQUENCING IN 
PROGRESS ***, 5 unordered pieces. 
AC090871 

AC090871.7 GI:14573697 
HTG; HTGS_PHASE1. 
Oryza sativa. 
Oryza sativa 

Eukaryot a ; Vir idiplantae ; Streptophyta ; Embryophyt a ; Tracheophyt a ; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; 
Ehrhartoideae; Oryzeae; Oryza. 

1 (bases 1 to 165873) 

Buell,R., Hsiao, J., Zismann,V., Moffat, K.M., Hill, J. , 
Gansberger, K. , Burgess, S., Jarrahi>B., Shvartsbeyn, M . , Brenner, M., 
Ciecko,A., Pai,G., Vanaken,S., Hansen, C, Utterbach, T . , 
Feldblyum, T . , Khalak,H.G., Yuan,Q., Quackenbush, J , , White, O., 
Salzberg,S. and Fraser,C. 

Oryza sativa ssp. japonica cv. Nipponbare OS JNBb0060 J2 1 BAG genomic 

sequence 

Unpublished 

2 (bases 1 to 165873) 
Buell, R. 

Direct Submission 

Submitted ( 13-MAR-2001 ) The Institute for Genomic Research, 9712 
Medical Center Dr, Rockville, MD 20850, USA 

On Jun 28, 2001 this sequence version replaced gi:14333942. 

* NOTE: This is a 'working draft 1 sequence. It currently 

* consists of 5 contigs. The true order of the pieces 



* is not known and their order in this sequence record is 

* arbitrary. Gaps between the contigs are represented as 

* runs of N, but the exact sizes of the gaps are unknown. 

* This record will be updated with the finished sequence 

* as soon as it is available and the accession number will 





be preserved. 












1 


11885: 


contig 


of 11885 


bp in 


length 


* 


11886 


11923: 


gap of 


unknown 


length 






11924 


28536: 


contig 


of 16613 


bp in 


length 


* 


28537 


28574 : 


gap of 


unknown 


length 




* 


28575 


95942: 


contig 


of 67368 


bp in 


length 




95943 


95979: 


gap of 


unknown 


length 




★ 


95980 


122498: 


contig 


of 26519 


bp in 


length 


* 


122499 


122535: 


gap of 


unknown 


length 






122536 


165873: 


contig 


of 43338 


bp in 


length 



FEATURES Location/Qualifiers 
source 1. .165873 

/organism="Oryza sativa" 

/cultivar="Nipponbare" 

/sub_species=" japonica" 

/db_xref="taxon: 4530" 

/chromosome=" 3 " 

/clone="OSJNBb0060J21" 
BASE COUNT 46732 a 35660 c 36160 g 47138 t 183 others 
ORIGIN 



Query Match 20.5%; Score 86.4; DB 2; Length 165873; 

Best Local Similarity 65.7%; Pred. No. 4e-10; 

Matches 140; Conservative 0; Mismatches 72; Indels 1; Gaps 1; 

Qy 195 cggcgatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagaggg 254 

I I I I II I I I I I I I I I I I I I I I I II I I III Ml II M 

Db 33044 CGTCGCCGGCGACGGGGGAGCAGCCGGAGGCGAACCGGTGCGCGACGTGCCGGAGGAAGG 32985 

Qy 255 ttggacttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctact 314 

I I I I I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 32984 TGGGGCTGACGGGGTTCAAGTGCCGGTGCGGCGGCACGTTCTGCGGCGGCCACCGCTACG 32925 

Qy 315 ccgacaagcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaagg 374 

I II I I I I I I M I I I I I I I II I I I II II I I I I I I I I I I I I 
Db 32924 CCGACGAGCACGGCTGCGGCTTCGACTACAAGAGCTCCGGGAGGGAGCTGATCGCCAA-G 32866 

Qy 375 gctaatccggtggtgaaggcagacaagctcgac 407 

I I I I I I I I II II I I II I I I I I I 
Db 32865 CAGAATCCGGTCGTCGTCGCCGACAAGCTGGCC 32833 



RESULT 7 
AP003883/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 



AP003883 136198 bp DNA HTG 10-JUL-2001 

Oryza sativa chromosome 8 clone OJ1134_H03 / *** SEQUENCING IN 
PROGRESS ***, in ordered pieces. 
AP003883 

AP003883.1 GI:14646816 
HTG; HTGS_PHASE2. 

Oryza sativa (cultivar : Nipponbare ) DNA, clone : OJ1 134_H03 . 



ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Oryza sativa 

Eukaryota; Viridiplantae ; Streptophyta; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; 
Ehrhartoideae; Oryzeae; Oryza. 

1 (bases 1 to 136198) 

Sasaki, T., Matsumoto,T. and Yamamoto,K. 

Oryza sativa nipponbare (GA3 ) genomic DNA, chromosome 8, BAC 
clone:OJ1134_H03 

Published Only in Database (2001) In press 

2 (bases 1 to 136198) 

Sasaki, T., Matsumoto,T. and Yamamoto,K. 
Direct Submission 

Submitted ( 09- JUL-2001 ) Takuji Sasaki, National Institute of 
Agrobiological Resources, Rice Genome Research Program; Kannondai 
2-1-2, Tsukuba, Ibaraki 305-8602, Japan 

( E-mail : tsasaki@abr . af f rc . go. jp, URL:http: //rgp . dna . af f rc . go . jp/ , 
Tel : 81-298-38-7441, Fax : 81-2 98-38-74 68 ) 

The nucleotide sequence of this BAC clone was generated by 
combining Monsanto and RGP- Japan sequencing data. 

NOTE: It currently consists of 1 contigs. Gaps between the contigs 
are represented as runs of N. The order of the pieces is believed 
to be correct as given, however the sizes of the gaps between them 
are based on estimates that have provided by the submitter. This 
sequence will be replaced by the finished sequence as soon as it is 
available and the accession number will be preserved. 

* NOTE: This is a 'working draft 1 sequence. 

* This sequence will be replaced 

* by the finished sequence as soon as it is available and 

* the accession number will be preserved. 

Location/Qualifiers 
1. .136198 

/organism="Oryza sativa" 
/cultivar= "Nipponbare" 
/db_xref="taxon: 4530" 
/ c h r omo s ome ="8" 
/clone="OJ1134_H03" 
40152 a 28322 c 28917 g 38606 t 201 others 



Query Match 20.0%; Score 84.4; DB 2; Length 136198; 

Best Local Similarity 68.0%; Pred. No. 1.2e-09; 

Matches 132; Conservative 0; Mismatches 61; Indels 1; Gaps 1; 

Qy 227 aaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtggg 28 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 87218 AACCGATGCCACAGCTGCCGGAGGCGGGTGGGCCTGACCGGGTTCCGGTGCCGCTGCGGC 87159 

Qy 287 aacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacgg 34 6 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 87158 GAGCTCTACTGCGGCGCGCACCGGTACTCCGACCGCCACGACTGCAGCTTCGACTACAAG 87099 

Qy 347 actgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcga 406 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 87098 TCGGCGGCGAGGGACGCCATCGCC-AGGGAGAACCCCGTCGTCCGCGCCGCCAAGATCGT 87040 



Qy 407 caagatctaagggg 420 



Db 87039 TAGGTTCTAAAAGG 87026 



RESULT 8 

PHVPVPR3A 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
FEATURES 

source 



mRNA 
gene 
CDS 



primer_bind 



BASE COUNT 
ORIGIN 



PHVPVPR3A 871 bp mRNA PLN 12-AUG-1994 

P. vulgaris PVPR3 protein mRNA, complete cds . 

M75856 

M75856.1 GI: 169362 
PVPR3 protein. 

Phaseolus vulgaris cDNA to mRNA. 
Phaseolus vulgaris 

Eukaryota; Viridiplantae; Streptophyta ; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 
Rosidae; eurosids I; Fabales; Fabaceae; Papilionoideae; Phaseoleae; 
Phaseolus . 
1 (bases 1 to 871) 

Sharma,Y.K., Hinoj os , C . M . and Mehdy,M. 

cDNA cloning, structure and expression of a novel pathogenesis 
related protein in bean 

Mol. Plant Microb. Interact. 5, 89-95 (1992) 
Location/Qualifiers 
1. .871 

/organism="Phaseolus vulgaris" 

/db_xref="taxon: 3885" 

1. .871 

/gene="PVPR3" 

1. .871 

/gene= n PVPR3" 

116. .529 

/gene="PVPR3" 

/codon_start=l 

/protein_id="AAA33773 . 1" 

/db_xref="GI : 169363" 

/translation="MAQKAQKNDTDFKVPEPITPCATATAAAATSISEPSRFFDAATP 
ATS SRSPKRSLPLEDAANADRT VASE PKRAVNRCSGCRRRVGLTGFRCRCGDLFCAEH 
RYTDRHDCSYDYKTVGREAIARENPVVKAAKIVKV" 
complement (195 . . 212) 
/gene="PVPR3" 
/note="5 ! primer" 
209 a 234 c 198 g 230 t 



Query Match 19.4%; Score 81.6; DB 8; Length 871; 

Best Local Similarity 63.3%; Pred. No. le-08; 

Matches 140; Conservative 0; Mismatches 80; Indels 1; Gaps 1; 

Qy 196 ggcgatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggt 255 

I I I I I I I MM I M M M M M I M I M I I I I 

Db 310 GACGGTGGCATCGGAGCCGAAGCGCGCGGTGAACCGGTGCTCGGGATGCCGGCGGCGCGT 369 

Qy 256 tggacttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactc 315 

M M M M M M M I M M I M M I I M M M M M I M I I 
Db 370 CGGGCTGACCGGATTCCGCTGTCGGTGCGGTGACCTTTTCTGCGCTGAGCACCGGTACAC 42 9 



Qy 316 cgacaagcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaaggg 375 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 30 GGACCGCCACGACTGCTCCTACGACTACAAGACCGTCGGGAGAGAGGCCATCG-CGAGGG 4 88 

Qy 376 ctaatccggtggtgaaggcagacaagctcgacaagatctaa 416 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 89 AGAATCCCGTCGTGAAGGCCGCGAAGATCGTGAAGGTCTGA 52 9 



RESULT 9 

AC090871 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



AC090871 165873 bp DNA HTG 28-JUN-2001 

Oryza sativa chromosome 3 clone OS JNBb0060 J21 , *** SEQUENCING IN 
PROGRESS ***, 5 unordered pieces. 
AC090871 

AC090871.7 GI:14573697 
HTG; HTGS_PHASE1 . 
Oryza sativa. 
Oryza sativa 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; 
Ehrhartoideae; Oryzeae; Oryza. 

1 (bases 1 to 165873) 

Buell,R., Hsiao, J., Zismann,V., Moffat, K.M., Hill, J., 
Gansberger, K. , Burgess, S., Jarrahi,B., Shvartsbeyn, M . , Brenner, M., 
Ciecko,A., Pai,G., Vanaken,S., Hansen,C, Utterbach, T . , 
Feldblyum, T. , Khalak,H.G., Yuan,Q., Quackenbush, J . , White, 0., 
Salzberg,S. and Fraser,C. 

Oryza sativa ssp. japonica cv. Nipponbare OS JNBb0060 J2 1 BAC genomic 

sequence 

Unpublished 

2 (bases 1 to 165873) 
Buell, R. 

Direct Submission 

Submitted ( 13-MAR-2001) The Institute for Genomic Research, 9712 
Medical Center Dr, Rockville, MD 20850, USA 

On Jun 28, 2001 this sequence version replaced gi:14333942. 

* NOTE: This is a ! working draft 1 sequence. It currently 

* consists of 5 contigs. The true order of the pieces 

* is not known and their order in this sequence record is 

* arbitrary. Gaps between the contigs are represented as 

* runs of N, but the exact sizes of the gaps are unknown. 

* This record will be updated with the finished sequence 

* as soon as it is available and the accession number will 

* be preserved. 



FEATURES 

source 





1 


11885: 


contig 


of 11885 bp in 


length 




11886 


11923: 


gap of 


unknown length 






11924 


28536: 


contig 


of 16613 bp in 


length 




28537 


28574: 


gap of 


unknown length 






28575 


9.5942: 


contig 


of 67368 bp in 


length 


* 


95943 


95979: 


gap of 


unknown length 




★ 


95980 


122498: 


contig 


of 26519 bp in 


length 




122499 


122535: 


gap of 


unknown length 






122536 


165873: 


contig 


of 43338 bp in 


length 




Location/Qualifiers 






1. 


. 165873 









/organism="Oryza sativa" 



/cultivar= "Nipponbare" 

/sub_species=" japonica" 

/db_xref="taxon:4530" 

/ c h r omo s ome = " 3 " 

/clone="OSJNBb00 60J21" 
BASE COUNT 46732 a 35660 c 36160 g 47138 t 183 others 
ORIGIN 



Query Match 18.9%; Score 79.6; DB 2; Length 165873; 

Best Local Similarity 67.7%; Pred. No. 1.4e-08; 

Matches 126; Conservative 0; Mismatches 59; Indels 1; Gaps 1; 

Qy 223 gccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtg 282 

I I I I I I I I I I I I I II III I I I I I I III II II I I I I I I I I I I I I I I I I I I 
Db 123242 GCCGAACCGGTGCGCGGCGTGCCGGAAGAAGGTGGGGCTGACGGGGTTCAAGTGCCGGTG 123301 

Qy 283 tgggaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgacta 34 2 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 123302 CGGCGGCAACTTCTGTGGCGGCCACCGCCACGCCGACGCCCACGGCTGCGGCTTCGACTA 123361 

Qy 34 3 acggactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagc 4 02 

II 1 I I I 1 I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 123362 CAAGAGCGCCGGGAAGGAGCAGATCGCCAA-GCAGAACCCGCTCGTCGTCGCCGACAAGC 123420 

Qy 403 tcgaca 408 
I I I I 

Db 123421 TGGCCA 123426 



RESULT 10 
AP003239/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



AP003239 151085 bp DNA PLN 07-MAR-2001 

Oryza sativa genomic DNA, chromosome 1, PAC clone : P0403C05, 
complete sequence. 
AP003239 

AP003239.2 GI: 13365491 
HTG. 

Oryza sativa (cultivar : Nipponbare) DNA, clone : P04 03C05 . 
Oryza sativa 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; 
Ehrhartoideae; Oryzeae; Oryza. 

1 (sites) 

Sasaki, T., Matsumoto, T. and Yamamoto, K . 

Oryza sativa nipponbare (GA3 ) genomic DNA, chromosome 1, PAC 
clone:P0403C05 

Published Only in DataBase (2001) In press 

2 (bases 1 to 151085) 

Sasaki, T . , Matsumoto, T. and Yamamoto, K. 
Direct Submission 

Submitted ( 1 9-FEB-2001 ) Takuji Sasaki, National Institute of 
Agrobiological Resources, Rice Genome Research Program; Kannondai 
2-1-2, Tsukuba, Ibaraki 305-8602, Japan 

(E-mail : tsasakidabr . af frc. go. jp, URL:http: //rgp . dna . af f rc . go . jp/, 
Tel: 81-298-38-74 41, Fax:81-298-38-74 68) 

On Mar 16, 2001 this sequence version replaced gi: 13027269. 



The orientation of the sequence is from T7 to SP6 of the PAC clone. 
FEATURES Location/Qualifiers 
source 1. .151085 

/organism="Oryza sativa" 

/cultivar="Nipponbare" 

/db_xref="taxon: 4530" 

/chromosome^" 1 " 

/clone="P0403C05" 
BASE COUNT 41675 a 33792 c 35141 g 40477 t 
ORIGIN 



Query Match 18.6%; Score 78.4; DB 8; Length 151085; 

Best Local Similarity 62.4%; Pred. No. 2.7e-08; 

Matches 138; Conservative 0; Mismatches 82; Indels 1; Gaps 1; 

Qy 191 gtggcggcgatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaag 250 

I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 178 4 4 GAGCCGGCGACGGTAGTAGAGAAGAAGGCGCCGGCGAACCGGTGCGCGTCGTGCAAGAAG 17785 

Qy 251 agggttggacttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgc 310 

I I I I I I I I I I I I I I I I I 11 I I I I I I II I I I I I II I I I 

Db 17784 AAGGTGGGACTGTTGGGGTTCGCATGCCGATGCGGGGCTACCTACTGCGGGACGCACCGA 17725 

Qy 311 tactccgacaagcacgactgcaagttcgactaacggactgctgccaaggacgccattgcc 370 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 17724 TACCCGGAGAAGCACGCCTGCGGCTTCGACTTCAAGGGCGCCAGCCGCGACGCCATAGCC 17665 

Qy 371 aagggctaatccggtggtgaaggcagacaagctcgacaaga 411 

I I I I I I I I I I I I I I I I I I I I I I II I 
Db 17 664 -CGCGCCAACCCGCTCATCAAGGGCGAGAAGCTGACCAACA 17 625 



RESULT 11 

AP003453 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AP003453 151100 bp DNA PLN 10-AUG-2001 

Oryza sativa genomic DNA, chromosome 1, PAC clone : P04 80C01 , 
complete sequence. 
AP003453 

AP003453.2 GI: 15146362 
HTG. 

Oryza sativa (cultivar : Nipponbare ) DNA, clone : P04 80C01 . 
Oryza sativa 

Eukaryota; Viridiplantae ; Streptophyta; Embryophyta ; Tracheophyta ; 
Spermatophyta ; Magnoliophyta; Liliopsida; Poales; Poaceae; 
Ehrhartoideae; Oryzeae; Oryza. 

1 (bases 1 to 151100) 

Sasaki, T., Matsumoto,T. and Yamamoto,K. 

Oryza sativa nipponbare (GA3 ) genomic DNA, chromosome 1, PAC 
clone:P0480C01 

Published Only in Database (2001) In press 

2 (bases 1 to 151100) 

Sasaki, T., Matsumoto,T. and Yamamoto,K. 
Direct Submission 

Submitted ( 28-MAR-2001 ) Takuji Sasaki, National Institute of 
Agrobiological Resources, Rice Genome Research Program; Kannondai 
2-1-2, Tsukuba, Ibaraki 305-8602, Japan 



(E-mail : tsasaki@nias ,af f rc . go. jp, URL : http : //rgp . dna . af f rc . go . jp/ , 
Tel: 81-2 98-38-7441, Fax : 81-298-38-74 68 ) 
COMMENT On Aug 9, 2001 this sequence version replaced gi : 13486922. 

The orientation of the sequence is from T7 to SP6 of the PAC clone. 
FEATURES Location/Qualifiers 
source 1. .151100 

/organism="Oryza sativa" 
/cultivar="Nipponbare" 
/db_xref="taxon:4530" 
/ ch r omo s ome = 11 1 " 
/clone="P0480C01" 
BASE COUNT 42198 a 33215 c 33471 g 42216 t 
ORIGIN 



Query Match 18.6%; Score 78.4; DB 8; Length 151100; 

Best Local Similarity 64.4%; Pred. No. 2.7e-08; 

Matches 132; Conservative 0; Mismatches 72; Indels 1; Gaps 1; 

Qy 212 gggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattc 271 

III I I I I i I II I I I I I I I I I I I I I I I I E I I I I I I I I I 

Db 108804 GGGGCGAAGACGAAGAGCAGGTGCGCTGCGTGCGGGAGGAGCGTGGGGCTGATGGGGTTC 108863 

Qy 272 aactgccggtgtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgc 331 

I I I I I I II II I III Mill II II II I I I I I I I I I I I I I I I I II 

Db 108864 GAGTGCCGCTGCGGTGCCGTGTTCTGCGGCGCGCATCGCTACTCGGACAGGCACGACTGC 108923 

Qy 332 aagttcgactaacggactgctgccaaggacgccattgccaagggctaatccggtggtgaa 391 

I | I I I I II III II I I I I I I I I I I I I I I I I II I I I I 

Db 108924 GGCTACGATTACAGGGGCGCCGGGCGCGACGCCATCGCC-CGCGCCAACCCCGTCGTGAG 108982 

Qy 392 ggcagacaagctcgacaagatctaa 416 

I 1(1111 I II III Ml I 
Db 108 983 ACCCGACAAGGTGGAGAAGCTCTGA 109007 



RESULT 12 
AP003288/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 



AP003288 159497 bp DNA HTG 21-FEB-2001 

Oryza sativa chromosome 1 clone P0683B11, *** SEQUENCING IN 
PROGRESS ***, in ordered pieces. 
AP003288 

AP003288.1 GI:13027318 
HTG; HTGS_PHASE2. 

Oryza sativa (cultivar : Nipponbare ) DNA, clone : P0683B11 . 
Oryza sativa 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; 
Ehrhartoideae; Oryzeae; Oryza. 

1 (sites) 

Sasaki, T . , Matsumoto,T. and Yamamoto,K. 

Oryza sativa nipponbare (GA3 ) genomic DNA, chromosome 1, PAC 
clone:P0683Bll 

Published Only in DataBase (2001) In press 

2 (bases 1 to 159497) 

Sasaki, T . , Matsumoto,T. and Yamamoto,K. 
Direct Submission 



JOURNAL 



COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Submitted ( 19-FEB-2001) Takuji Sasaki, National Institute of 
Agrobiological Resources, Rice Genome Research Program; Kannondai 
2-1-2, Tsukuba, Ibaraki 305-8602, Japan 

(E-mail : tsasaki@abr . af f rc . go . jp, URL : http: //rgp . dna . af f rc . go . jp/, 
Tel: 81-2 98-38-74 41, Fax:81-2 98-38-7 4 68) 

NOTE: It currently consists of 1 contigs. Gaps between the contigs 
are represented as runs of N. The order of the pieces is believed 
to be correct as given, however the sizes of the gaps between them 
are based on estimates that have provided by the submitter. This 
sequence will be replaced by the finished sequence as soon as it i 
available and the accession number will be preserved. 

* NOTE: This is a 'working draft' sequence. 

* This sequence will be replaced 

* by the finished sequence as soon as it is 

* the accession number will be preserved. 

Location/Qualifiers 
1. .159497 

/organism="Oryza sativa" 
/cultivar="Nipponbare n 
/db_xref="taxon: 4530" 
/ c h r omo s ome = " 1 " 
/clone="P0683Bll" 
45304 a 34607 c 35698 g 43688 t 200 others 



available and 



Query Match 18.6%; Score 78.4; DB 2; Length 159497; 

Best Local Similarity 64.4%; Pred. No. 2.6e-08; 

32; Conservative 0; Mismatches 72; Indels 1; Gaps 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



aactgccggtgtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgc 331 

I I I I I I II II I III I I I I I II I I I I I I I I I I I I I I I I II I I I I 

GAGTGCCGCTGCGGTGCCGTGTTCTGCGGCGCGCATCGCTACTCGGACAGGCACGACTGC 14 4 084 

aagttcgactaacggactgctgccaaggacgccattgccaagggctaatccggtggtgaa 391 

I I I I I I II Ell I I I M I I I I II I II II II II MM 

GGCTACGATTACAGGGGCGCCGGGCGCGACGCCATCGCC-CGCGCCAACCCCGTCGTGAG 14 4 025 

ggcagacaagctcgacaagatctaa 416 

I I I I I I I I II III III I 
AC C C G AC AAG G T G G AG AAGC T C T G A 14 4 000 



Matches 


Qy 


212 


Db 


144203 


Qy 


272 


Db 


144143 


Qy 


332 


Db 


144083 


Qy 


392 


Db 


144024 



RESULT 13 

AY042871 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



AY042871 871 bp mRNA PLN 04-JUL-2001 

Arabidopsis thaliana zinc finger-like protein (F3C22_200) mRNA, 
complete cds . 
AY042871 

AY04 2871 . 1 GI : 14 596166 
FLI_CDNA. 
thale cress. 
Arabidopsis thaliana 



Eukaryota; Viridiplantae ; Streptophyta; Embryophyta ; Tracheophyta; 
Spermatophyta; Magnoliophyta; eudicotyledons / core eudicots; 
Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidppsis. 
REFERENCE 1 (bases 1 to 871) 

AUTHORS Nguyen, M . , Karlin-Neumann, G . , Southwick, A . , Lam; B., Miranda, M., 
Palm, C. J., Bowser, L., Jones, T., Banh,J., Carninci,P., Chen,H., 
Cheuk,R., Chung,M.K., Hayashi zaki , Y . , Ishida,J., Kamiya,A., 
Kawai,J., Kim,C, Lin, J., Liu,S.X., Narusaka,M., Pham,P.K., 
Sakano,H., Sakurai,T., Satou,M., Seki,M., Shinn,P., Yamada,K., 
Shinozaki, K. , Ecker,J., Theologis,A. and Davis, R.W. 

TITLE Direct Submission 

JOURNAL Submitted (25- JUN-2001 ) DNA Sequencing and Technology Center, 

Stanford University, 855 California Avenue, Palo Alto, CA 94304, 
USA 

COMMENT e-mail for correspondence: arab@sequence.stanford.edu 

RIKEN Genomic Sciences Center (GSC) members carried out the 
collection and clustering of RAFL cDNAs (RAFL cDNA : 'RIKEN 
Arabidopsis Full-Length cDNA 1 ) : Seki,M., Narusaka,M., Ishida,J., 
Satou,M., Kamiya,A., Sakurai,T., Carninci,P., Kawai,J., 
Hayashizaki, Y. and Shinozaki, K. 

The Salk, Stanford, PGEC (SSP) Consortium members carried out the 
sequencing and annotation of the RAFL cDNAs : Nguyen, M., 
Southwick, A. , Karlin-Neumann, G . , Lam, B., Miranda, M., Palm, C. J., 
Bowser, L., Jones, T., Banh,J., Chen,H., Cheuk,R., Chung, M.K., 
Kim,C, Lin, J., Liu,S.X., Pham,P.K., Sakano,H., Shinn,P., 
Yamada,K., Ecker,J., Theologis,A. and Davis, R.W. 



Nguyen,M., (SSP/Stanf ord) andSeki,M. (RIKEN GSC) contributed 
equally to this work. Shinozaki, K. (RIKEN GSC) and Davis, R.W. 
(SSP/Stanf ord) contributed equally to this work as Pis. 
FEATURES Location/Qualifiers 
source 1. .871 

/organism^ "Arabidopsis thaliana" 
/db_xref="taxon: 3702" 
/chromosome="III" 
/clone="RAFL04-17-A20" 
/note="This clone is in pBluescript 
ecotype: Columbia" 
gene 1. .871 

/gene="F3C22_200" 
CDS 93. .602 

/gene="F3C22_200" 
/codon_start=l 

/product=" zinc finger-like protein" 
/protein_id="AAK68811 .1" 
/db_xref="GI : 14596167" 

/translation="MAEEHRCQTPESNRLCVNNCGFLGSATMNLCSNCYGDLCLKQQQ 
QSSSIKSTVESSLSVSPPSSSSSEISSPIIPPLLKNPSVKLEVPEKKAVISLPTTEQN 
QQQRPNRCTTCRKRVGLTGFKCRCGTMFCGVHRYPEIHGCSYDFKSAGREEIAKANPL 
VKAAKLQKI" 

BASE COUNT 216 a 208 c 190 g 257 t 

ORIGIN 



Query Match 



17.2%; Score 72.6; DB 8; Length 871; 



Best Local Similarity 64.1%; Pred. No. l.le-06; 

Matches 125; Conservative 0; Mismatches 69; Indels 1; Gaps 1; 



Qy 222 ggccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggt 281 

I I I I i I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I M I I I I 
Db 4 09 GGCCGAATCGGTGCACGACGTGTAGGAAACGGGTCGGGTTAACCGGATTCAAGTGCCGGT 4 68 

Qy 282 gtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgact 341 

I I I I III I I I I III I I I I I I I I II I I I I I I I I I I 

Db 4 69 GCGGTACGATGTTTTGCGGGGTTCATAGGTACCCGGAGATCCATGGATGCAGCTACGATT 528 

Qy 342 aacggactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaag 401 

I II I III II II II I I I I I I I I I I I I I I I I I I I III 

Db 52 9 TCAAATCGGCCGGACGTGAAGAGATCGCGAA-AGCGAATCCGTTGGTGAAAGCAGCGAAG 587 

Qy 402 ctcgacaagatctaa 416 

II I I II II I I 

Db 588 CTTCAGAAGATATGA 602 



RESULT 14 

ATF3C22 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



Brassicaceae ; Arabidopsis. 

Boutry,M., Mewes,H.W., Rudd,S. 
and Salanoubat , M . 



misc feature 



ATF3C22 100028 bp DNA PLN 27-APR-2000 

Arabidopsis thaliana DNA chromosome 3, BAC clone F3C22. 
AL353912 

AL353912.1 GI:7669934 

thale cress. 
Arabidopsis thaliana 

Eukaryota; Vir idiplantae; Streptophyta ; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 
Rosidae; eurosids II; Brassicales; 

1 (bases 1 to 100028) 
Purnelle,B., Masuy,D., Goffeau,A., 
Lemcke , K . , Mayer, K. F.X. , Quetier, F. 
Unpublished 

2 (bases 1 to 100028) 

EU Arabidopsis sequencing, pro j ect . 
Direct Submission 

Submitted (27-APR-2000) MIPS, at the Max-Planck-Institut fuer 
Biochemie, Am Klopferspitz 18a, D-82152 Martinsried, FRG, E-mail: 
lemcke @mips . biochem . mpg . de , mayer@mips . biochem . mpg . de Pro j ect 
Coordinator: Marcel Salanoubat and Francis Quetier, Groupement 
d f Interet Public, Centre National de Sequencage - GENOSCOPE; 2 rue 
Gaston Cremieux, BP191, 91006 Evry Cedex, France; 
http : //www . genoscope . ens . f r 

Information on performance of analysis and a more detailed 
annotation of this entry and other sequences of chromosomes 3, 4 
and 5 can be viewed at: http://www.mips.biochem.mpg.de/proj/thal/. 

Location /Qualifiers 

1. .100028 

/organism= "Arabidopsis thaliana" 
/variety="Columbia" 
/db_xref="taxon:3702" 
/chromosome="3" 
1. .31726 

/note-"overlap to BAC F2206, EMBL : AL050300 ; please refre 



to this entry for analysis and annotation" 
gene 31023. .33027 

/gene="F3C22_10" 

CDS join(31023. .317 90,3214 4. .32312,32567. .33027) 

/gene="F3C22_10" 
/codon_start=l 
/product="putative protein" 
/protein_id="CAB89222. 1" 
/db_xref ="GI : 7 669935" 

/translation= M MTRPVGGTEYSWCRAIDGGTGIAVIALLLSRTPKLQNLQNTLDK 

LQIYHPTLRSNIRFDASANSFSFVVTSAADSHVEIHPFDSVSTAQIIRDSDDPCADPH 

RIILEHEMNKNTWINPHRWIKSESRVFIVSLYDLTDDGEQRILTFRLNTAAVDRTAAV 

TLLREFMKETAADGFGNGPVVAATETAVGLGKAIEELIPSGKGDKPFWARGIDVLGYS 

LNAFRFSNLNFVDAENSNRRSQLVRLKLDRDQTLKLVAGCKARGLKLWAALASSALIA 

AYSSKNLPPYQGEKYAVVTLSDCRSILEPPLTSNDFGFYHAGILHTHDLTGEEKLWDL 

AKRCYDSFT.SSKNSNKQFTDMSDLNFLMCKAIENPNLTPSSSLRTAFISIFEDPVIDE 

SPEPELASLGVQDYIGCASIHGVGPSVAVFDALRDGKLDCAFVYPSPLHSREQMDGLI 

QHMKTILLEGSASSF" 
exon 31023. .31790 

/gene="F3C22_10" 

/number=l 
intron 31791. .32143 

/gene="F3C22_10" 

/number=l 
exon 32144. .32312 

/gene="F3C22_10" 

/number =2 
intron 32313. .32566 

/gene="F3C22_10" 

/number=2 
exon 32567. .33027 

/gene="F3C22_10" 

/number=3 
exon 33593. .33706 

/gene="F3C22_20" 

/number-1 
gene 33593. .34036 

/gene="F3C22_20" 
CDS join(33593. .33706,33887. .34036) 

/gene="F3C22_20" 

/note="similarity to phosphate actyltransf erase, 
Staphylococcus aureus, EMBL : SAU27 14 96" 
/codon_start=l 
/product="putative protein" 
/protein_id="CAB89223. 1" 
/db_xref="GI : 7669936" 

/trans lation="MATDTTTFPEGDAEAELNPGQNKNFEVDPVSS EI ASEKVQKIVE 

KRENAKAKRDGARTLKNTIIVSAVIVAVAGAAFAITKKLKENK" 
intron 33707. .33886 

/gene="F3C22_20" 

/number=l 
exon 33887. .34036 

/gene="F3C22_20" 

/number =2 
exon 34664. .34922 

/gene="F3C22_30" 

/number=l 



gene 
CDS 



.35076) 



intron 

exon 

gene 
gene 

CDS 



exon 



intron 



exon 



intron 



exon 



, .35822,36109 

,37234,37453. 

,38142,38245. 



, .36553,36641. 
,37572,37659. . 
,38298) ) 



. .35822,36109. .36553,36641, 
,37234,37453. .37572,37 659. . 
,38142,38245. .38298) ) 



.36725, 
37730, 



.36725, 
37730, 



34664. .35076 
/gene= ,f F3C22_30" 
join(34664. .34 922,35012. 
/gene="F3C22_30" 

/note="similarity to predicted proteins, Arabidopsis, Rice 

and Drosophila" 

/codon_start=l 

/product="putative protein" 

/protein_id="CAB89224 .1" 

/db_xref="GI : 7669937" 

/trans la tion="MDTSSPSAFVNGALLRRFIGQKVRTVIQVTGSEIGSVVGKSTDD 
LQIVVRGSSPPSPLTTYLEVIGIAESDNAIRAETWTNFGNTFDTQNYNELCKLANGEF 
KHLFI" 

34923. .35011 
/gene="F3C22_30" 
/number =1 
35012. .35076 
/gene="F3C22_30" 
/number=2 
35558. .38298 
/gene="F3C22_40" 
complement (join (35558 , 
36916. .37056,37163. . 
37838. .37970,38069. , 
/gene="F3C22_40" 
complement (join (35558 , 
36916. .37056,37163. , 
37838. .37970,38069. , 
/gene="F3C22_40" 

/note="similarity to KIAA0253 protein, Homo saoiens, 
SWI SS PROT : Y2 5 3_HUMAN 

Contains Phosphopantetheine attachment site AA250-265" 

/codon_start=l 

/product="putative protein" 

/protein_id="CAB89225.1" 

/db_xref-"GI : 7669938" 

/trans lation="MKHGTYTSDVAEFNMVMETTKAGTHNSEACLQEGTCLPLGGYSV 

WSSLPPISVSSSNNRKPVVLTVASMDTASFFRDKSFGADSPISGLVALLGAVDALSRV 

DGISNLKKQLVFLVLTGETWGYLGSRRFLHELDLHSDAVAGLSNTSIETVLEIGSVGK 

GLSGGINTFFAHKTRVSSVTNMTLDALKIAQDSLASKNIKILSADTANPGIPPSSLMA 

FMRKNPQTSAVVLEDFDTNFVNKFYHSHLDDLSNINSSSVVAAASVVARTLYILASDN 

KDTSNSALGSIHVNASFVEELLTCLLACEPGLSCNLVKDYISPTNTCPGNYAGVILGE 

PSSKPYLGYVGDVSRFLWNFLADKTSVQKGNTTSVCSKGVCSKTDEVCIKAESNKEGT 

CVVSTTRYVPAYSTRLKYNDGAWTILPQNSSDSMGMVDPVWTESNWDTLRVHVYTVQH 

SAYDNAVLVAGITVTTLAYIGILAAKSIITKALKQD" 

complement (35558 . .35822) 

/gene="F3C22_40" 

/number=l 

complement (35823. .36108) 
/number=l 

complement (36109. .36553) 

/gene="F3C22_40" 

/ number=2 

complement (36554 . . 36640) 
/number=2 

complement (36641. .36725) 
/gene="F3C22_40" 



intron 
exon 

intron 
exon 

intron 
exon 

intron 
exon 

intron 
exon 

intron 
exon 

intron 
exon 

exon 

gene 
gene 
CDS 



/number =3 
complement (36726 . 
/number=3 
complement (36916 . 
/gene="F3C22_40" 
/number=4 
complement (37057 . 
/number =4 
complement (37163. 
/gene= n F3C22_40" 
/number=5 
complement ( 37235 . 
/number=5 
complement ( 37453 . 
/gene="F3C22_40" 
/number=6 
complement (37573. 
/number=6 
complement (37 659. 
/gene="F3C22_40" 
/number=7 
complement ( 37 7 31 . 
/number =7 
complement (37838 . 
/gene="F3C22__40" 
/number=8 
complement (37 971 . 
/number=8 
complement (38069. 
/gene="F3C22_40" 
/number=9 
complement (38143. 
/ number=9 
complement ( 38245 . 
/gene="F3C22_40" 
/number=10 
complement ( 4 04 50 . 
/gene="F3C22_50" 
/number=l 
40450. .40770 
/gene="F3C22_50" 
complement (join (40450. 
/gene="F3C22_50" 
complement (join (404 50 . 
/gene="F3C22_50" 
/codon_start=l 

/product="hypothetical protein" 
/protein_id="CAB89226. 1" 
/db_xref="GI : 7669939" 

/trans lation="MAMGLIRLLSIAFTLVLLSILPLHLSLADEITSIESVPDLQKLM 
YVAVDGFPCVRLLNLSGEIGCSSKLLLGLSAT " 



.36915) 
.37056) 

.37162) 
.37234) 

.37452) 
.37572) 

.37658) 
.37730) 

.37837) 
.37970) 

.38068) 
.38142) 

.38244) 
.38298) 

.40595) 



,40595, 40686. .40770) ) 
,40595,40686. .40770) ) 



Query Match 17.2%; Score 72.6; DB 8; Length 100028; 

Best Local Similarity 64.1%; Pred. No. 5.9e-07; 

Matches 125; Conservative 0; Mismatches 69; Indels 1; Gaps 1; 



Qy 222 ggccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggt 281 



Db 90605 GGCCGAATCGGTGCACGACGTGTAGGAAACGGGTCGGGTTAACCGGATTCAAGTGCCGGT 90 664 

Qy 282. gtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgact 341 

I I I I III I I I I III I I I I I II I III I I I I I I I I I 

Db 90665 GCGGTACGATGTTTTGCGGGGTTCATAGGTACCCGGAGATCCATGGATGCAGCTACGATT 9072 4 

Qy 342 aacggactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaag 401 

I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 907 25 TCAAATCGGCCGGACGTGAAGAGATCGCGAA-AGCGAATCCGTTGGTGAAAGCAGCGAAG 90783 

Qy 402 ctcgacaagatctaa 416 

II I I I I I I I I 

Db 90784 CT T C AG AAG AT AT G A 90798 



RESULT 15 

AP002044 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



AP002044 21200 bp DNA PLN 19-JUL-2000 

Arabidopsis thaliana genomic DNA, chromosome 3, PI clone: MMF12. 
AP002044 

AP002044 .1 GI:8051652 

Arabidopsis thaliana ( strain : Columbia ) DNA, clone_lib : Mitsui PI 

clone:MMF12. 

Arabidopsis thaliana 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 
Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. 

1 (sites) 

Kaneko,T., Katoh,T., Sato,S., Nakamura,A., Asamizu,E. andTabata,S. 
Structural analysis of Arabidopsis thaliana chromosome 3. II. 
Sequence features of the 4,251,695 bp regions covered by 90 PI, TAC 
and BAC clones 

DNA Res. 7 (3), 217-221 (2000) 
20363099 

2 (bases 1 to 21200) . 

Kaneko,T., Kato,T., Sato,S., Nakamura,Y., Asamizu,E. and Tabata,S, 
Direct Submission 

Submitted ( 1 6-MAY-2000 ) to the DDBJ/EMBL/GenBank databases. 
Yasukazu Nakamura, Kazusa DNA Research Institute, Department of 
Plant Gene Research; 1532-3, Yana, Kisarazu, Chiba 292-0812, Japan 
(E-mail : ynakamu@ kazusa .or.jp, Tel:81-438-52-3935, 
Fax:81-438-52-3934) 

Address for correspondence: kaos@kazusa.or.jp 

For the latest information on annotation of this clone, please see 

http: //www. kazusa . or . jp/kaos/cgi-bin/agd_graph . cgi?c=MMF12 

Genes with similarity to proteins in the databases are described in 

'product' or 'note' qualifiers. Genes that have no significant 
protein similarity are described as 'unknown protein'. 
The software programs used to predict genes include: Grail 

(Informatics Group, Oak Ridge National Laboratory, 
http: //compbio.ornl . gov/Grail-1 . 3/) , 

GENSCAN (Chris Burge, MIT, http://CCR-081.mit.edu/GENSCAN.html), 
NetGene2 (S.M. Hebsgaard, et al., CBS, Technical University of 
Denmark, http : //www . cbs . dtu . dk/services/NetGene2 / ) and 
SplicePredictor (Volker Brendel, Stanford University, 



FEATURES 

source 



CDS 



tRNA 

CDS 
,5435, 
,7525, 
,10543) ) 



http : //gremlinl . zool . iastate . edu/cgi-bin/sp . cgi ) . 
Genes encoding tRNAs are predicted by tRNAscan-SE 
(Sean Eddy, Washington University School of Medicine, St. Louis, 
http: //genome. wustl . edu/eddy/tRNAscan-SE/ ) . 

This sequence may not be the entire insert of this clone. It may be 
shorter because we remove overlaps between neighboring submissions. 
The 5* clone is T16H11 and the 3 1 clone is MBK21. 

Locat ion /Qualifiers 

1. .21200 

/organ ism="Arabidops is t ha liana" 

/strain= "Columbia" 

/db_xref="taxon: 3702" 

/chromosome="3" 

/clone="MMF12" 

/clone_lib="Mitsui PI" 

complement (697 . .810) 

/note="gb|AAF23278.1 

gene_id:MMF12.1 

similar to unknown protein" 

/codon_start=l 

/evidence=not_experimental 

/protein_id="BAB02249. 1" 

/db_xref="GI : 9294352" 

/translat ion="MIDDQDLGFIANFLGIFIFALVIAYHYVTADPKYEAT" 

complement (1205 . .1280) 

/note="gene_id:MMF12 .2" 

/evidence=not_experimental 

/product="tRNA-Arg(CCG) " 

complement (join (2584 . .2 658, 27 97 . .28 65, 2 976. .3036, 
3134. .3594,3792. .4409,4529. .4804,4920. .5117,5325. 

5520. .5804,5889. .5940,6069. .6248,6667. .6732,7268. 

7704. .8192,8476. .8526,8971. .9050,9544. .9751,10440. 

/note="gene_id:MMF12 . 3 
unknown protein" 
/codon_start=l 
/evidence=not_experimental 
/protein_id="BAB02250. 1" 
/db_xref="GI : 9294353" 

/translat ion="MNGSAQSACHVGYGTSYWAHSFVFRFNGPTFNIGSPGQSPGSSR 
LLQLGAAGSASRLRSSSSKKPPEPLRRAVADCLSSSPPPVNSHHGAIPSMAPSEALRN 
LRDYLSASATTDLAYNMLLEHTIAERDRSPAVVTRCVALLKRYILRYKPGEETLLQVD 
KFCVNLIAECDASLKQKSLPVLSAPAGASPLPVSSFASAALVKSLHYVRSLVALHIPR 
RSFQPAAFAGATLASRQLLPSLSSLLSKSFNSQLSPANAAESPQKKDAANLSVSNLSN 
IQEINAHEDTEYISSDLLNWRWVGELQLSSASSESERPVNLQDMNNCNLLEVGAAGLL 
VGDMEAKMKGQHWKYFGTAEMPYLEQLLQPASVTMITNSASARSHLRAITASKRTRAG 
PQQIWDDSTVNTFRPRARPLFQYRHYSEQQPLRLNPAEVGEVIAAVCSEASSTPSNQM 
TVSPQLTSKTGKPSMDVAVSVLIKLVIDMYVLDARIAAPLTLSMLEEMLCSTKAPCRI 
RVFDLILNLGVHAQLLEPMISDNATTIEEDYAQETYIDNENRLLLQGTRTKDLPKMSS 
TSSAIENFESWILKILFEILLLLVQVEEKEECVWASALSCLLYFICDRGKIRRNQLNG 
LDIRVIKALLGTSKRNSWSEVVHSKLICIMTNMFYQSPEPEGSNKAISSASNFLIDQV 
DLIGGVEYIFFEYSLATTREERRNLYSVLFDYVLHQINEACSSAGLSEYTDDEIQPLA 
VRLALADAPEAFYISVKLGVEGIGEILRRSIAAALSGFSNSERLNQLLANITEKFDTI 
IGSFTHLDKEFLHLKQITKSSKFMESILDLRNDISMSVNLAWATLHSLLHSERTTYRQ 
NGYIWLGDLLIAEISEESGGSIWLSIKDLQQKIAHCGTSDSLVTSDVPISIHLLCGLL 



KSRNSVIRWGFLFILERLLMRSKFLLDENETQRSTGGVATQDHKDKRLEKANAVIDIM 
SSALSLMAQINETDRINILKMCDILFSQLCLKVLSTDEDAVPNSADRNSKFDTSHRNS 
YKESVDEGDTKPRYNNVSVSTCETASMAAMLLRGQAIVPMQLVARVPAALFYWPLIQL 
AGAAT DN I ALGVAVGSKGRGN I PGATS DI RATLLLLL IGKCTADT VAFQEVGGEE FFR 
ELLDDTDSRVAYYSSAFLLKRMMTEEPEKYQNMLQKLVFKAQQSNNEKLLENPYLQMC 
GILQLSNEL" 

join (10 920. . 10 990, 114 67 . . 11581 , 11658 . . 117 41 , 11834 . 

/note="gb | AAF25994 . 1 

gene_id:MMF12 . 4 

similar to unknown protein" 

/codon_start=l 

/evidence=not_experimental 

/protein_id="BAB02251 .1" 

/db_xref="GI: 9294354" 

/translation="MCDLVARTGRLQQRYEDGSRLVAGCIPFRYVNSDKDGNSESGKV 

IQVLMISSSSGPGLLFPKGGWENDETVREAAAREAVEEAGVRGILMDFLGNYEFKSKS 

HQDEFSPEGLCKAAMYALYVKEELATWPEHETRTRKWLTIEEAVES.CRHPWMKDALVE 

GFCKWHKEKMVKGEEITGEH" 

complement (12772. .13890) 

/note="gene_id:MMF12 . 5" 

/codon_start=l 

/evidence=not_experimental 

/product="DNA-damage-repair /toleration protein-like; 
disease resistance protein; polygalacturonase 
inhibitor-like protein" 
/protein_id="BAB02252 . 1 " 
/db_xref="GI: 9294355" 

/translation="MRKLLASPFSSLLAVVFISVISVVRCCSPKDQTALNAFKSSLSE 
PNLGIFNTWSENTDCCKEWYGISCDPDSGRVTDISLRGESEDAIFQKAGRSGYMSGSI 
DPAVCDLTALTSLVLADWKGITGEIPPCITSLASLRILDLAGNKITGEIPAEIGKLSK 
LAVLNLAENQMSGEIPASLTSLIELKHLELTENGITGVIPADFGSLKMLSRVLLGRNE 
LTGSI PESISGMERLADLDLSKNHIEGPIPEWMGNMKVLSLLNLDCNSLTGPIPGSLL 
SNSGLDVANLSRNALEGTIPDVFGSKTYLVSLDLSHNSLSGRIPDSLSSAKFVGHLDI 
SHNKLC'GRIPTGFPFDHLEATSFSDNQCLCGGPLTTSC" 

complement (join (15621. . 15848 , 15936 . . 16172, 16323 . .16690, 

16780. .17104)) 

/note="gene_id:MMF12. 6" 

/codon_start=l 

/evidence=not_experimental 

/product="protein phosphatase 2C" 

/protein_id="BAB02253 .1" 

/db_xref="GI: 9294356" 

/translation="MVSSATILRMVAPCWRRPSVKGDHSTRDANGRCDGLLWYKDSGN 
HVAGEFSMSVIQANNLLEDHSKLESGPVSMFDSGPQAT FVGVYDGHGGPEAARFVNKH 
LFDNIRKFTSENHGMSANVITKAFLATEEDFLSLVRRQWQIKPQIASVGACCLVGIIC 
SGLLYIANAGDSRVVLGRLEKAFKIVKAVQLSSEHNASLESVREELRSLHPNDPQIVV 
LKHKVWRVKGIIQVSRSIGDAYLKKAEFNREPLLAKFRVPEVFHKPILRAE PAITVHK 
IHPEDQFLIFASDGLWEHLSNQEAVDIVNTCPRNGIARKLIKTALREAAKKREMRYSD 
LKKIDRGVRRHFHDDITVIVVFLDSHLVSRSTSRRPLLSISGGGDLAGPST" 
18818. .19300 

/note="contains similarity to unknown protein 

gb| AAA33773.1 

gene_id:MMF12.7" 

/codon_start=l 

/evidence=not__experimental 

/protein_id="BAB02254 .1" 



/db_xref="GI : 9294357" 

/translat ion="MAQRTEKEETEFKVLETLTTTTTTLCTNNCGVTANPATNNMCQK 
CFNASLVSAAAGVVESGSILKRSARSVNLRSSPAKVVIRPREIDAVKKRDQQIVNRCS 
GCRKKVGLTGFRCRCGELFCSEHRYSDRHDCSYDYKTAGREAIARENPVVKAAKMVKV 

IT 

exon 20567. .20701 

/note="CDS is reported in Acc# AB024033 
contains similarity to poly (A) -binding protein 
gene_id:MMF12.8" 
/ number =1 

/evidence=not_experimental 
exon ■ 20882. .20982 

/note="CDS is reported in.Acc# AB024033 
contains similarity to poly (A) -binding protein 
gene_id:MMF12.8" 
/number ^2 

/evidence=not_experimental 
exon 21064. .21200 

/note="CDS is reported in Acc# AB024033 
contains similarity to poly (A) -binding protein 
gene_id:MMF12 .8" 
/ number=3 

/evidence=not__experimental 
BASE COUNT 6829 a 4011 c 4053 g 6307 t 

ORIGIN 



Query Match 16.4%; Score 69.2; DB 8; Length 21200; 

Best Local Similarity 63.7%; Pred. No. 4.3e-06; 

Matches 121; Conservative 0; Mismatches 68; Indels 1; Gaps 1; 

Qy 227 aaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtggg 286 

Mill. II I I I I I I I I I I M M I II I I I I I I I I I I I 

Db 19112 AACCGATGTTCTGGTTGTCGGAAGAAAGTTGGGTTGACCGGATTCAGGTGCCGATGCGGT 19171 

Qy 287 aacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacgg 346 

I I I I I I I I I I M I I I I I I II II I I I I I I I I I I I I I I 
Db 19172 GAGCTTTTCTGCTCAGAGCACCGTTACTCCGATCGTCATGATTGTAGCTACGACTACAAA 19231 

Qy 347 actgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcga 406 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I! I I I I II II 

Db 19232 ACCGCTGGTCGTGAGGCGATCG-CTAGAGAAAATCCGGTGGTCAAGGCTGCGAAAATGGT 19290 

Qy 407 caagatctaa 416 

III llll 
Db 19291 CAAAGTTTAA 19300 



Search completed: February 7, 2002, 11:06:26 
Job time: 9912 sec 

GenCore version 4.5 
Copyright (c) 1993 - 2000 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



February 7, 2002, 11:00:01 / Search time 428.31 Seconds 

(without alignments) 
842.693 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-09-394-745-6489 
421 

1 ggggccagcacgcgtccggc ctcgacaagatctaaggggg 421 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



930621 seqs, 428662619 residues 



Total number of hits satisfying chosen parameters: 



1861242 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



N_Geneseq_1101 : * 

/SIDS2/gcgdata/geneseq/genese 
/SIDS2/gcgdata/geneseq/genese 
/SIDS2/gcgdata/geneseq/genese 
/SIDS2/gcgdata/geneseq/genese 
/SIDS2/gcgdata/geneseq/genese 
/SIDS2/gcgdata/geneseq/genese 
/SIDS2/gcgdata/geneseq/genese 
/SIDS2/gcgdata/geneseq/genese 
/SIDS2/gcgdata/geneseq/genese 
/SIDS2/gcgdata/geneseq/genes 
/SIDS2/gcgdata/geneseq/genes 
/SIDS2/gcgdata/geneseq/genes 
/SIDS2/gcgdata/geneseq/genes 
/SIDS2/gcgdata/geneseq/genes 
/SIDS2/gcgdata/geneseq/genes 
/SIDS2/gcgdata/geneseq/genes 
/SIDS2/gcgdata/geneseq/genes 
/SIDS2/gcgdata/geneseq/genes 
/SIDS2/gcgdata/geneseq/genes 
/SIDS2/gcgdata/geneseq/gehes 
/SIDS2/gcgdata/geneseq/genes 
/SIDS2/gcgdata/geneseq/genes 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 



qn/NAl 980. DAT:* 
qn/NA1981 . DAT : * 
qn/NA1982.DAT:* 
qn/NAl 983 . DAT : * 
qn/NAl 984. DAT:* 
qn/NAl 985. DAT:* 
qn/NAl 986 . DAT : * 
qn/NAl 987 . DAT : * 
qn/NAl 98 8. DAT:* 
eqn/NAl 98 9 . DAT : 
eqn/NAl 990 . DAT : 
eqn/NA1991 . DAT: 
eqn/NAl 992 . DAT : 
eqn/NAl 993. DAT: 
eqn/NA1994 .DAT: 
eqn/NAl 9 95 . DAT : 
eqn/NA1996 . DAT : 
eqn/NAl 997 .DAT: 
eqn/NAl 998 .DAT: 
eqn/NAl 999 . DAT : 
eqn/NA2000 . DAT : 
eqn/NA2001 .DAT: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 
Arabidopsis thalia 



93.4 



22.2 



998 21 AAC39959 
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ALIGNMENTS 



RESULT 1 
AAC39959 

ID AAC39959 standard; DNA; 998 BP. 
XX 

AC AAC39959; 
XX 

DT 17-OCT-2000 (first entry) 
XX 



DE Arabidopsis thaliana DNA fragment SEQ ID NO: 26530. 
XX 

KW Hybridisation assay; genetic mapping; gene expression control; 

KW protein identification; signal transduction pathway; 

KW metabolic pathway; promoter; termination sequence; ss. 
XX 

OS Arabidopsis thaliana. 
XX 

PN EP1033405-A2. 
XX 

PD 06-SEP-2000. 
XX 

PF 25-FEB-2000; 2000EP-03014 39 . 
XX 

PR 25-FEB-1999; 99US-0121825 . 

PR 05-MAR-1999; 99US-0123180 . 

PR 09-MAR-1999; 99US-0123548 . 

PR 23-MAR-1999; 99US-01257 88 . 

PR 25-MAR-1999; 99US-0126264 . 

PR 29-MAR-1999; 99US-0126785 . 

PR 01-APR-1999; 99US-0127 4 62 . 

PR 06-APR-1999; 99US-0128234 . 

PR 08-APR-1999; 99US-01287 14 . 

PR 16-APR-1999; 99US-0129845 . 

PR 19-APR-1999; 99US-0130077 . 

PR 21-APR-1999; 99US-01304 4 9 . 

PR 23-APR-1999; 99US-0130510 . 

PR 23-APR-1999; 99US-01308 91 . 

PR 28-APR-1999; 99US-01314 4 9 . 

PR 30-APR-1999; 99US-0132048 . 

PR 30-APR-1999; 99US-0 1324 07 . 

PR 04-MAY-1999; 99US-01324 84 . 

PR 05-MAY-1999; 99US-01324 85 . 

PR 06-MAY-1999; 99US-0132486 . 

PR 06-MAY-1999; 99US-0132487 . 

PR 07-MAY-1999; 99US-01328 63 . 

PR ll-MAY-1999; 99US-0134256 . 

PR 14-MAY-1999; 99US-0134218 . 

PR 14-MAY-1999; 99US-01342 19 . 

PR 14-MAY-1999; 99US-0134221 . 

PR 14-MAY-1999; 99US-0134370 . 

PR 18-MAY-1999; 99US-01347 68 . 

PR 19-MAY-1999; 99US-0134941 . 

PR 20-MAY-1999; 99US-0135124 . 

PR 21-MAY-1999; 99US-0135353 . 

PR 24-MAY-1999; 99US-0135629 . 

PR 25-MAY-1999; 99US-0136021 . 

PR 27-MAY-1999; 99US-0136392 . 

PR 28-MAY-1999; 99US-01367 82 . 

PR 01-JUN-1999; 99US-0137222 . 

PR 03-JUN-1999; 99US-0137 528 . 

PR 04-JUN-1999; 99US-0137502 . 

PR 07-JUN-1999; 99US-0137724 . 

PR 08-JUN-1999; 99US-0 138094 . 

PR 10-JUN-1999; 99US-0138540 . 

PR 10-JUN-1999; 99US-0138847 . 

PR 14-JUN-1999; 99US-013911 9 . 
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99US-0139455 
99US-0139456 
99US-0139457 
99US-0139458 
99US-0139459 
99US-0139460 
99US-0139461 
99US-0139462 
99US-0139463 
99US-0139750 
99US-0139763 
99US-0139817 
99US-0139899 
99US-0140353 
99US-0140354 
99US-0140695 
99US-0140823 
99US-0140991 
99US-0141287 
99US-0141842 
99US-0142154 
99US-0142055 
99US-0142390 
99US-0142803 
99US-0142920 
99US-0142977 
99US-0143542 
99US-0143624 
99US-0144005 
99US-0144085 
99US-0144086 
99US-0144325 
99US-0144331 
99US-0144332 
99US-0144333 
99US-0144334 
99US-0144335 
99US-0144352 
99US-0144632 
99US-0144884 
99US-0144814 
99US-0145086 
99US-0145088 
99US-0145085 
99US-0145087 
99US-0145089 
99US-0145192 
99US-0145145 
99US-0145218 
99US-0145224 
99US-0145276 
99US-0145913 
99US-0145918 
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Query Match 22.2%; Score 93.4; DB 21; Length 998; 

Best Local Similarity 70.8%; Pred. No. 3.9e-16; 

Matches 138; Conservative 0; Mismatches 56; Indels 1; Gaps 

Qy 222 ggccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggt 281 

I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 

Db 461 gaccaagccgatgtactacttgcaataagcgggttggcctgactggattcaaatgtcgct 520 

Qy 282 gtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgact 341 

Mill! I I I Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 521 gtgggagcctcttctgcggaacacaccgctatgcagacatacatgactgctccttcaatt 580 

Qy 342 aacggactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaag 401 

II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I . 

Db 581 accatgctgctgcgcaagaggcgatagctaa-ggcaaacccggttgtgaaagcagagaag 639 

Qy 402 ctcgacaagatctaa 416 

II I I I I I I I I I I 
Db 640 cttgacaaaatctga 654 



RESULT 2 
AAC52046 

ID AAC52046 standard; DNA; 1116 BP. 
XX 

AC AAC52046; 
XX 

DT 18-OCT-2000 (first entry) 
XX 

DE Arabidopsis thaliana DNA fragment SEQ ID NO: 70270. 
XX 

KW Hybridisation assay; genetic mapping; gene expression control; 
KW protein identification; signal transduction pathway; 



KW metabolic pathway; promoter; termination sequence; ss. 
XX 

OS Arabidopsis thaliana. 
XX 

PN EP1033405-A2. 
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AT ") A A A 1 

-0130891 


PR 


28 


-APR- 


1999, 


99US 


ft -i ft i a i n 

-0131449 


PR 


30 


-APR- 


1999, 


99US 


ft. i 'i a a yi A 

-0132048 


PR 


30 


-APR- 


1999, 


99US 


ft. i ''i A A n 

-0132407 


PR 


04 


-MAY- 


1999, 


99US 


-0132484 


PR 


05 


-MAY- 


1999, 


99US 


-0132485 


PR 


06 


-MAY- 


1999, 


99US 


-0132486 


PR 


06 


-MAY- 


1999, 


99US 


-0132487 


PR 


07 


-MAY- 


1999, 


99US 


ft 1 ft, ft O 

-0132863 


PR 


11 


-MAY- 


1999, 


99US 


-0134256 


PR 


14 


-MAY- 


1999, 


99US 


-0134218 


PR 


14 


-MAY- 


1999, 


99US 


-0134219 


PR 


14 


-MAY- 


1999, 


99US 


-0134221 


PR 


14 


-MAY- 


1999, 


99US 


-0134370 


PR 


18 


-MAY- 


1999, 


99US 


-0134768 


PR 


19 


-MAY- 


1999 


• ' 99US 


-0134941 


PR 


20 


-MAY- 


1999 


99US 


-0135124 


PR 


21 


-MAY- 


1999 


99US 


-0135353 


PR 


24 


-MAY- 


1999 


99US 


-0135629 


PR 


25 


-MAY- 


1999 


99US 


-0136021 


PR 


27 


-MAY- 


1999 


99US 


-0136392 


PR 


28 


-MAY- 


1999 


; 99US 


-0136782 


PR 


01 


-JUN- 


1999 


; 99US 


-0137222 


PR 


03 


-JUN- 


1999 


; 99US 


-0137528 


PR 


04 


-JUN- 


1999 


; 99US 


-0137502 


PR 


07 


-JUN- 


1999 


? 99US 


-0137724 


PR 


08 


-JUN- 


1999 


99US 


-0138094 


PR 


10 


-JUN- 


1999 


99US 


-0138540 


PR 


10 


-JUN- 


1999 


99US 


-0138847 


PR 


14 


-JUN- 


1999 


99US 


-0139119 


PR 


16 


-JUN- 


1999 


99US 


-0139452 


PR 


16 


-JUN- 


1999 


99US 


-0139453 


PR 


17 


-JUN- 


1999 


99US 


-0139492 


PR 


18 


-JUN- 


1999 


99US 


-0139454 



PR 


18- 


- JUN- 


1999 


PR 


18- 


-JUN- 


1999 


PR 


18- 


-JUN- 


1999 


PR 


18- 


- JUN- 


1999 


PR 


18 


-JUN- 


1999 


PR 


18 


-JUN- 


1999 


PR 


18 


-JUN- 


1999 


PR 


18 


-JUN- 


1999 


PR 


18 


- JUN- 


1999 


PR 


18 


- JUN- 


1999 


PR 


18 


- JUN- 


1999 


PR 


21 


- JUN- 


1999 


PR 


22 


- JUN- 


1999 


PR 


23 


- JUN- 


1999 


PR 


23 


- JUN- 


1999 


PR 


24 


-JUN- 


1999 


PR 


28 


- JUN- 


1999 


PR 


29 


-JUN- 


1999 


PR 


30 


-JUN- 


1999 


PR 


01 


-JUL- 


1999 


PR 


01 


-JUL- 


1999 


PR 


02 


- JUL- 


1999 


PR 


06 


- JUL- 


1999 


PR 


08 


- JUL- 


1999 


PR 


09 


-JUL- 


1999 


PR 


12 


-JUL- 


1999 


PR 


13 


- JUL- 


1999 


PR 


14 


-JUL- 


1999 


PR 


15 


-JUL- 


1999 


PR 


16 


-JUL- 


1999 


PR 


16 


-JUL- 


1999 


PR 


19 


- JUL- 


1999 


PR 


19 


-JUL- 


1999 


PR 


19 


- JUL- 


1999 


PR 


19 


-JUL- 


1999 


PR 


19 


- JUL- 


1999 


PR 


19 


-JUL- 


1999 


PR 


20 


-JUL- 


1999 


PR 


20 


- JUL- 


1999 


PR 


20 


- JUL- 


1999 


PR 


21 


- JUL- 


1999 


PR 


21 


- JUL- 


1999 


PR 


21 


- JUL- 


1999 


PR 


22 


-JUL- 


1999 


PR 


22 


-JUL- 


1999 


PR 


22 


-JUL- 


1999 


PR 


22 


-JUL- 


1999 


PR 


23 


-JUL- 


1999 


PR 


23 


-JUL- 


1999 


PR 


23 


- JUL- 


1999 


PR 


26 


-JUL- 


1999 


PR 


27 


-JUL- 


1999 


PR 


27 


-JUL- 


1999 


PR 


27 


-JUL- 


1999 


PR 


28 


-JUL- 


1999 


PR 


02 


-AUG- 


1999 


PR 


02 


-AUG- 


1999 



99US-0139455 
99US-0139456 
99US-0139457 
99US-0139458 
99US-0139459 
99US-0139460 
99US-0139461 
99US-0139462 
99US-0139463 
99US-0139750 
99US-0139763 
99US-0139817 
99US-0139899 
99US-0140353 
99US-0140354 
99US-0140695 
99US-0140823 
99US-0140991 
99US-0141287 
99US-0141842 
99US-0142154 
99US-0142055 
99US-0142390 
99US-0142803 
99US-0142920 
99US-0142977 
99US-0143542 
99US-0143624 
99US-0144005 
99US-0144085 
99US-0144086 
99US-0144325 
99US-0144331 
99US-0144332 
99US-0144333 
99US-0144334 
99US-0144335 
99US-0144352 
99US-0144632 
99US-0144884 
99US-0144814 
99US-0145086 
99US-0145088 
99US-0145085 
99US-0145087 
99US-0145089 
99US-0145192 
99US-0145145 
99US-0145218 
99US-0145224 
99US-0145276 
99US-0145913 
99US-0145918 
99US-0145919 
99US-0145951 
99US-0146386 
99US-0146388 



PR 02-AUG-1999, 

PR 03-AUG-1999, 

PR 04-AUG-1999, 

PR 04-AUG-1999, 

PR 05-AUG-1999, 

PR 05-AUG-1999, 

PR 06-AUG-1999, 

PR 06-AUG-1999, 

PR 09-AUG-1999, 

PR 09-AUG-1999, 

PR 10-AUG-1999, 

PR ll-AUG-1999, 

PR 12-AUG-1999, 

PR 13-AUG-1999, 

PR 13-AUG-1999, 

PR 16-AUG-1999, 

PR 17-AUG-1999, 

PR 18-AUG-1999, 

PR 20-AUG-1999, 

PR 20-AUG-1999 

PR 20-AUG-1999, 

PR 23-AUG-1999 

PR 23-AUG-1999 

PR 25-AUG-1999 

PR 26-AUG-1999 

PR 27-AUG-1999 

PR 27-AUG-1999 

PR 27-AUG-1999 

PR 30-AUG-1999 

PR 31-AUG-1999 

PR 01-SEP-1999 

PR 07-SEP-1999 

PR 10-SEP-1999 

PR 13-SEP-1999 

PR 15-SEP-1999 

PR 16-SEP-1999 

PR 20-SEP-1999 

PR 22-SEP-1999 

PR 23-SEP-1999 

PR 24-SEP-1999 

PR 28-SEP-1999 

PR 29-SEP-1999 

PR 04-OCT-1999 

PR 05-OCT-1999 

PR 06-OCT-1999 

PR 07-OCT-1999 

PR 08-OCT-1999 

PR 12-OCT-1999 

PR 13-OCT-1999 

PR 13-OCT-1999 

PR 13-OCT-1999 

PR 14-OCT-1999 

PR 14-OCT-1999 

PR 14-OCT-1999 

PR 14-OCT-1999 

PR 14-OCT-1999 

PR 18-OCT-1999 



99US-0146389 
99US-0147038 
99US-0147204 
99US-0147302 
99US-0147192 
99US-0147260 
99US-0147303 
99US-0147416 
99US-0147493 
99US-0147935 
99US-0148171 
99US-0148319 
99US-0148341 
99US-0148565 
99US-0148684 
99US-0149368 
99US-0149175 
99US-0149426 
99US-0149722 
99US-0149723 
99US-0149929 
99US-0149902 
99US-0149930 
99US-0150566 
99US-0150884 
99US-0151065 
99US-0151066 
99US-0151080 
99US-0151303 
99US-0151438 
99US-0151930 
99US-0152363 
99US-0153070 
99US-0153758 
99US-0154018 
99US-0154039 
99US-0154779 
99US-0155139 
99US-0155486 
99US-0155659 
99US-0156458 
99US-0156596 
99US-0157117 
99US-0157753 
99US-0157865 
99US-0158029 
99US-0158232 
99US-0158369 
99US-0159293 
99US-0159294 
99US-0159295 
99US-0159329 
99US-0159330 
99US-0159331 
99US-0159637 
99US-0159638 
99US-0159584 



PR 


21 


-OCT- 


1999, 


99US- 


01607 4 1 


PR 


21 


-OCT- 


1999, 


99US- 


01607 67 


PR 


21 


-OCT- 


1999, 


99US- 


01607 6b 


PR 


21 


-OCT- 


1999, 


99US- 


0160770 


PR 


21 


-OCT- 


1999, 


99US- 


0160814 


PR 


21 


-OCT- 


1999, 


99US- 


A 1 /"AO 1 r 

0160815 


PR 


22 


-OCT- 


1999, 


99US- 


A *1 /" A A O A 

0160980 


PR 


22 


-OCT- 


1999, 


99US- 


at rA no 1 

0160981 


PR 


22 


-OCT- 


i d n o 

1999, 


99US- 




PR 


25 


-OCT- 


1999, 


99US- 


at /r i / a >i 

01614 04 


PR 


25 


-OCT- 


1999, 


99US- 


A 1 /" T J AC 

01614 05 


PR 


25 


-OCT- 


1999, 


99US- 


AT 1 1 A C\ C 

0161406 


PR 


26 


-OCT- 


1999, 


99US- 


AT C T OCA 

0161359 


PR 


26 


-OCT- 


1999, 


99US- 


0161360 


PR 


26 


-OCT- 


1999, 


99US- 


0161361 


PR 


28 


-OCT- 


1999, 


99US- 


0161920 


PR 


28 


-OCT- 


1999, 


99US- 


0161992 


PR 


28 


-OCT- 


1999, 


99US- 


0161993 


PR 


29 


-OCT- 


1999, 


99US- 


0162142 



Query Match 22.1%; Score 93; DB 21; Length 1116; 

Best Local Similarity 70.3%; Pred. No. 5.2e-16; 



Matches 


137; Conservative 1; Mismatches 56; Indels 1; Gaps 


Qy 


222 


ggccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggt 

1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II 1 1 1 1 1 
gaccaagccgatgtactacttgcaataagcgggttggcctgactggattcaaatgtcgct 


281 


Db 


564 


623 


Qy 


282 


gtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgact 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II INI! 
gtgggagcctcttctgcggaacacaccgctatgcagacatacatgactgctccttcaatt 


341 


Db 


624 


683 


Qy 


342 


aacggactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaag 

II 11:1111 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 
accatgctsctgcgcaagaggcgatagctaa-ggcaaacccggttgtgaaagcagagaag 


401 


Db 


684 


742 


Qy 


402 


ctcgacaagatctaa 416 
11 1 1 1 1 1 1 1 1 1 1 
cttgacaaaatctga 757 




Db 


743 





RESULT 3 
AAC43443 

ID AAC43443 standard; DNA; 522 BP. 
XX 

AC AAC43443; 
XX 

DT 17-OCT-2000 (first entry) 
XX 

DE Arabidopsis thaliana DNA fragment SEQ ID NO: 39268. 
XX 

KW Hybridisation assay; genetic mapping; gene expression control; 
KW protein identification; signal transduction pathway; 
KW metabolic pathway; promoter; termination sequence; ss. 
XX 

OS Arabidopsis thaliana. 
XX 



PN 


EP1033405-A2. 




XX 












PD 


06 


-SEP- 


2000 






XX 












PF 


25 


-FEB- 


2000; 2000EP- 


s\ ft ft. -i j n r\ 

0301439 


XX 












PR 


• 25 


-FEB- 


1999, 


99US- 


ft •* ft i n « r 

0121825 


PR 


05 


-MAR- 


1999, 


99US- 


ft. i a ft ft. ft. 

0123180 


PR 


09 


-MAR- 


1999, 


99US- 


0123548 


PR 


23 


-MAR- 


1999, 


99US- 


0125788 


PR 


25 


-MAR- 


1999, 


99US- 


At ft A /"* /I 

0126264 


PR 


29 


-MAR- 


1999, 


99US- 


0126785 


PR 


01 


-APR- 


1999, 


99US- 


At r\ <-t A A 

01274 62 


PR 


06 


-APR- 


1999, 


99US- 


r\ -i ft r-\ ft —\ a 

0128234 


PR 


08 


-APR- 


1999, 


99US- 


0128714 


PR 


16 


-APR- 


1999, 


99US- 


A 1 AAA A t~ 

0129845 


PR 


19 


-APR- 


1999, 


99US- 


0130077 


PR 


21 


-APR- 


1999, 


99US- 


0130449 


PR 


23 


-APR- 


1999, 


99US- 


0130510 


PR 


23 


-APR- 


1999 , 


99US- 


0130891 


PR 


28 


-APR- 


1999, 


99US- 


01314 4 9 


PR 


30 


-APR- 


1999, 


99US- 


0132048 


PR 


30 


-APR- 


1999, 


99US- 


ft -i ft ft A ft. "1 

0132407 


PR 


04 


-MAY- 


1999, 


99US- 


ft -i ft o a a a 

01324 8 4 


PR 


05 


-MAY- 


1999, 


99US- 


/\ -i ft ft * ft i- 
0132485 


PR 


06 


-MAY- 


1999, 


99US- 


ft -1 ft ft A ft f~ 

0132486 


PR 


06 


-MAY- 


1999, 


99US- 


ft -1 ft A A ft. "~l 

0132487 


PR 


07 


-MAY- 


1999, 


99US- 


/-\ -i ^ ft ft /— -^ 
0132863 


PR 


11 


-MAY- 


1999, 


99US- 


0134256 


PR 


14 


-MAY- 


1999, 


99US- 


0134218 


PR 


14 


-MAY- 


1999, 


99US- 


0134219 


PR 


14 


-MAY- 


1999, 


99US- 


0134221 


PR 


14 


-MAY- 


1999, 


99US- 


0134370 


PR 


18 


-MAY- 


1999, 


99US- 


0134768 


PR 


19 


-MAY- 


1999 


99US- 


0134941 


PR 


20 


-MAY- 


1999 


99US- 


0135124 


PR 


21 


-MAY- 


1999 


99US- 


0135353 


PR 


24 


-MAY- 


1999 


99US- 


0135629 


PR 


25 


-MAY- 


1999 


99US- 


0136021 


PR 


27 


-MAY- 


1999 


99US- 


0136392 


PR 


28 


-MAY- 


1999 


99US- 


0136782 


PR 


01 


- JUN- 


1999 


99US- 


0137222 


PR 


03 


- JUN- 


1999 


99US- 


0137528 


PR 


04 


-JUN- 


1999 


99US- 


0137502 


PR 


07 


-JUN- 


1999 


99US- 


0137724 


PR 


08 


-JUN- 


1999 


; 99US- 


0138094 


PR 


10 


-JUN- 


1999 


; 99US- 


0138540 


PR 


10 


-JUN- 


1999 


; 99US- 


0138847 


PR 


14 


-JUN- 


1999 


; 99US- 


0139119 


PR 


16 


-JUN- 


1999 


? 99US- 


0139452 


PR 


16 


-JUN- 


1999 


; 99US- 


0139453 


PR 


17 


-JUN- 


1999 


; 99US- 


0139492 


PR 


18 


-JUN- 


1999 


99US- 


0139454 


PR 


18 


-JUN- 


1999 


99US- 


0139455 


PR 


18 


-JUN- 


1999 


99US- 


0139456 


PR 


18 


-JUN- 


1999 


99US- 


0139457 


PR 


18 


-JUN- 


1999 


99US- 


0139458 



PR 


18- 


-JUN- 


1999; 


99US- 


-01394 59 . 


PR 


18- 


-JUN- 


1999; 


99US- 


-01394 60 . 


PR 


18- 


-JUN- 


1999; 


99US- 


-01394 61 . 


PR 


18- 


-JUN- 


1999; 


99US- 


-0139462 . 


PR 


18- 


-JUN- 


1999; 


99US- 


-0139463 . 


PR 


18- 


-JUN- 


1999; 


99US- 


-0139750 . 


PR 


18- 


-JUN- 


1999; 


99US- 


-0139763 . 


PR 


21- 


-JUN- 


1999; 


99US- 


-0139817 . 


PR 


22 


-JUN- 


1999; 


99US- 


-0139899 . 


PR 


23 


-JUN- 


1999; 


99US- 


-0140353 . 


PR 


23 


-JUN- 


1999; 


99US- 


-0140354 . 


PR 


24 


-JUN- 


1999; 


99US- 


-0140695 . 


PR 


28 


-JUN- 


1999, 


99US- 


-0140823 . 


PR 


29 


-JUN- 


1999, 


99US- 


-0140991 . 


PR 


30 


-JUN- 


1999, 


99US- 


-0141287 . 


PR 


01 


-JUL- 


1999, 


99US- 


-0141842 . 


PR 


01 


-JUL- 


1999, 


99US 


-0142154 . 


PR 


02 


-JUL- 


1999, 


99US 


-0142055 . 


PR 


06 


-JUL- 


1999, 


99US 


-0142390 . 


PR 


08 


-JUL- 


1999, 


99US 


-0142803 . 


PR 


09 


- JUL- 


1999, 


99US 


-0142920 . 


PR 


12 


-JUL- 


1999, 


99US 


-0142977 . 


PR 


13 


-JUL- 


1999, 


99US 


-0143542 . 


PR 


14 


-JUL- 


1999, 


99US 


-0143624 . 


PR 


15 


-JUL- 


1999, 


99US 


-0144005 . 


PR 


16 


-JUL- 


1999, 


99US 


-0144085 . 


PR 


16 


-JUL- 


1999, 


99US 


-0144086 . 


PR 


19 


-JUL- 


1999, 


99US 


-0144325 . 


PR 


19 


-JUL- 


1999, 


99US 


-0144331 . 


PR 


19 


-JUL- 


1999, 


99US 


-0144332 . 


PR 


19 


-JUL- 


1999, 


99US 


-0144333 . 


PR 


19 


-JUL- 


1999, 


99US 


-0144334 . 


PR 


19 


-JUL- 


1999, 


99US 


-0144335 . 


PR 


20 


-JUL- 


1999, 


99US 


-0144352 . 


PR 


20 


-JUL- 


1999 


99US 


-0144632 . 


PR 


20 


-JUL- 


1999 


99US 


-0144884 . 


PR 


21 


-JUL- 


•1999 


99US 


-0144814 . 


PR 


21 


-JUL- 


■1999 


99US 


-0145086 . 


PR 


21 


-JUL- 


■1999 


99US 


-0145088 . 


PR 


22 


-JUL- 


■1999 


99US 


-0145085 . 


PR 


22 


-JUL- 


■1999 


99US 


-0145087 . 


PR 


22 


-JUL- 


-1999 


99US 


-0145089 . 


PR 


22 


-JUL- 


■1999 


99US 


-0145192 . 


PR 


23 


-JUL- 


-1999 


99US 


-0145145 . 


PR 


23 


-JUL- 


-1999 


; 99US 


-0145218 . 


PR 


23 


-JUL- 


-1999 


; 99US 


-0145224 . 


PR 


26 


-JUL- 


-1999 


; 99US 


-0145276. 


PR 


27 


-JUL- 


-1999 


99US 


-0145913 . 


PR 


27 


-JUL- 


-1999 


99US 


-0145918 . 


PR 


27 


-JUL- 


-1999 


99US 


-0145919 . 


PR 


28 


-JUL- 


-1999 


99US 


-0145951 . 


PR 


02 


-AUG- 


-1999 


99US 


-0146386. 


PR 


02 


-AUG- 


-1999 


99US 


-0146388. 


PR 


02 


-AUG- 


-1999 


99US 


-0146389. 


PR 


03 


-AUG- 


-1999 


99US 


-0147038. 


PR 


04 


-AUG- 


-1999 


99US 


-0147204 . 


PR 


04 


-AUG- 


-1999 


99US 


-0147302. 



PR 05-AUG-1999; 

PR 05-AUG-1999; 

PR 06-AUG-1999; 

PR 06-AUG-1999j 

PR 09-AUG-1999; 

PR 09-AUG-1999; 

PR 10-AUG-1999; 

PR ll-AUG-1999; 

PR 12-AUG-1999, 

PR 13-AUG-1999, 

PR 13-AUG-1999, 

PR 16-AUG-1999, 

PR 17-AUG-1999, 

PR 18-AUG-1999, 

PR 20-AUG-1999, 

PR 20-AUG-1999, 

PR 20-AUG-1999, 

PR 23-AUG-1999, 

PR 23-AUG-1999, 

PR 25-AUG-1999, 

PR 26-AUG-1999, 

PR 27-AUG-1999, 

PR 27-AUG-1999, 

PR 27-AUG-1999, 

PR 30-AUG-1999, 

PR 31-AUG-1999, 

PR 01-SEP-1999, 

PR 07-SEP-1999, 

PR 10-SEP-1999, 

PR 13-SEP-1999, 

PR 15-SEP-1999, 

PR 16-SEP-1999, 

PR 20-SEP-1999 

PR 22-SEP-1999 

PR 23-SEP-1999 

PR 24-SEP-1999 

PR 28-SEP-1999 

PR 29-SEP-1999 

PR 04-OCT-1999 

PR 05-OCT-1999 

PR 06-OCT-1999 

PR 07-OCT-1999 

PR 08-OCT-1999 

PR 12-OCT-1999 

PR 13-OCT-1999 

PR 13-OCT-1999 

PR 13-OCT-1999 

PR 14-OCT-1999 

PR 14-OCT-1999 

PR 14-OCT-1999 

PR 14-OCT-1999 

PR 14-OCT-1999 

PR 18-OCT-1999 

PR 21-OCT-1999 

PR 21-OCT-1999 

PR 21-OCT-1999 

PR 21-OCT-1999 



99US-0147192. 

99US-0147260. 

99US-0147303. 

99US-0147416. 

99US-0147493. 

99US-0147935. 

99US-0148171. 

99US-0148319. 

99US-0148341. 

99US-0148565. 

99US-0148684. 

99US-0149368. 

99US-0149175. 

99US-0149426. 

99US-0149722. 

99US-0149723. 

99US-0149929. 

99US-0149902. 

99US-0149930. 

99US-0150566. 

99US-0150884 . 

99US-0151065. 

99US-0151066. 

99US-0151080. 

99US-0151303. 

99US-0151438. 

99US-0151930. 

99US-0152363. 

99US-0153070. 

99US-0153758. 

99US-0154018. 

99US-0154039. 

99US-0154779. 

99US-0155139. 

99US-0155486 

99US-0155659 

99US-0156458 

99US-0156596 

99US-0157117 

99US-0157753 

99US-0157865 

99US-0158029 

99US-0158232 

99US-0158369 

99US-0159293 

99US-0159294 

99US-0159295 

99US-0159329 

99US-0159330 

99US-0159331 

99US-0159637 

99US-0159638 

99US-0159584 

99US-0160741 

99US-0160767 

99US-0160768 

99US-0160770 



PR 


21 


-OCT- 


1999, 


99US 


-0160814 


PR 


21 


-OCT- 


1999, 


99US 


-0160815 


PR 


22 


-OCT- 


1999, 


99US 


-01 60980 


PR 


22 


-OCT- 


1999, 


99US 


-01 60981 


PR 


22 


-OCT- 


1999, 


99US 


-01 60989 


PR 


25 


-OCT- 


1999, 


99US 


-0161404 


PR 


25 


-OCT- 


1999, 


99US 


-0161405 


PR 


25 


-OCT- 


1999, 


99US 


-0161406 


PR 


26 


-OCT- 


1999, 


99US 


-0161359 


PR 


26 


-OCT- 


1999, 


99US 


-0161360 


PR 


26 


-OCT- 


1999, 


99US 


-0161361 


PR 


28 


-OCT- 


1999, 


99US 


-0161920 


PR 


28 


-OCT- 


1999, 


99US 


-0161992 


PR 


28 


-OCT- 


1999 


99US 


-0161993 


PR 


29 


-OCT- 


1999 


; 99US 


-0162142 



Query Match 21.8%; Score 91.8; DB 21; Length 522; 

Best Local Similarity 70.3%; Pred. No. 8.9e-16; 

Matches 137; Conservative 0; Mismatches 57; Indels 1; Gaps 1; 

Qy 222 ggccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggt 281 

I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I i 
Db 329 gaccaagccgatgtactacttgcaataagcgggttggcctgactggattcaaatgtcgct 388 

Qy 282 gtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgact 341 

I I I ! I I I I I I II I I I I I I I I I I I I III I I MINI III I I 

Db 389 gtgggagcctcttctgcggaacacaccgctatgcagacgtacatgactgctccttcaatt 4 48 

Qy 342 aacggactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaag 4 01 

II I I I I I I I I I I i I I I M I I I II I I I I I I I I I I I I I I I I I I I I 

Db 44 9 accatgctgctgcgcaagaggcgatagctaa-ggcaaacccggttgtgaaagcagagaag 507 

Qy 402 ctcgacaagatctaa 416 

II I I I I I MM I 
Db 508 cttgacaaaatctga 522 



RESULT 4 
AAC49898 

ID AAC49898 standard; DNA; 928 BP. 
XX 

AC AAC4 98 98; 
XX 

DT 18-OCT-2000 (first entry) 
XX 

DE Arabidopsis thaliana DNA fragment SEQ ID NO: 62851. 
XX 

KW Hybridisation assay; genetic mapping; gene expression control; 
KW protein identification; signal transduction pathway; 
KW metabolic pathway; promoter; termination sequence; ss. 
XX 

OS Arabidopsis thaliana. 
XX 

PN EP1033405-A2 . 
XX 

PD 06-SEP-2000. 
XX 



PF 


25 


-FEB- 


2000, 


2000EP- 


0301439 


XX 












PR 


25 


-FEB- 


1999, 


99US- 


0121825 


PR 


05 


-MAR- 


1999, 


99US- 


0123180 


PR 


09 


-MAR- 


1999, 


99US- 


0123548 


PR 


23 


-MAR- 


1999, 


99US- 


0125788 


PR 


25 


-MAR- 


1999, 


99US- 


0126264 


PR 


29 


-MAR- 


1999, 


99US- 


0126785 


PR 


01 


-APR- 


1999, 


99US- 


0127462 


PR 


06 


-APR- 


1999, 


99US- 


0128234 


PR 


08 


-APR- 


1999 , 


99US- 


0128714 


PR 


16 


-APR- 


1999, 


99US- 


012984 5 


PR 


19 


-APR- 


1999, 


99US- 


0130077 


PR 


21 


-APR- 


1999, 


99US- 


0130449 


PR 


23 


-APR- 


1999, 


99US- 


0130510 


PR 


23 


-APR- 


1999, 


99US- 


0130891 


PR 


28 


-APR- 


1999, 


99US- 


n 1 -) 1 A a rs 

013144 9 


PR 


30 


-APR- 


1999, 


99US- 


0132048 


PR 


30 


-APR- 


1999, 


99US- 


0132407 


PR 


04 


-MAY- 


1999, 


99US- 


0132484 


PR 


05 


-MAY- 


1999, 


99US- 


0132485 


PR 


06 


-MAY- 


1999, 


99US- 


0132486 


PR 


06 


-MAY- 


1999, 


99US- 


0132487 


PR 


07 


-MAY- 


1999, 


99US- 


0132863 


PR 


11 


-MAY- 


1999, 


99US- 


0134256 


PR 


14 


-MAY- 


1999, 


99US- 


0134218 


PR 


14 


-MAY- 


1999, 


99US- 


0134219 


PR 


14 


-MAY- 


1999, 


99US- 


0134221 


PR 


14 


-MAY- 


1999, 


99US- 


0134370 


PR 


18 


-MAY- 


1999, 


99US- 


0134768 


PR 


19 


-MAY- 


1999, 


99US- 


0134941 


PR 


20 


-MAY- 


1999, 


99US- 


0135124 


PR 


21 


- MAY- 


1999, 


99US- 


0135353 


PR 


24 


DAY - 


1999, 


99US- 


0135629 


PR 


25 


-MAY- 


1999, 


99US- 


0136021 


PR 


27 


-MAY- 


1999, 


99US- 


0136392 


PR 


28 


-MAY- 


1999, 


99US- 


0136782 


PR 


01 


-JUN- 


1999 


99US- 


0137222 


PR 


03 


-JUN- 


1999 


99US- 


0137528 


PR 


04 


-JUN- 


1999 


99US- 


0137502 


PR 


07 


-JUN- 


1999 


99US- 


0137724 


PR 


08 


- JUN- 


1999 


99US- 


0138094 


PR 


10 


-JUN- 


1999 


99US- 


0138540 


PR 


10 


- JUN- 


1999 


99US- 


0138847 


PR 


14 


-JUN- 


1999 


; 99US- 


0139119 


PR 


16 


-JUN- 


1999 


; 99US- 


0139452 


PR 


16 


-JUN- 


1999 


; 99US- 


0139453 


PR 


17 


-JUN- 


1999 


; 99US- 


0139492 


PR 


18 


-JUN- 


1999 


; 99US- 


0139454 


PR 


18 


-JUN- 


1999 


; 99US- 


0139455 


PR 


18 


-JUN- 


1999 


; 99US- 


0139456 


PR 


18 


-JUN- 


1999 


; 99US- 


0139457 


PR 


18 


-JUN- 


1999 


99US- 


0139458 


PR 


18 


-JUN- 


1999 


99US- 


0139459 


PR 


18 


-JUN- 


1999 


99US- 


0139460 


PR 


18 


-JUN- 


1999 


99US- 


0139461 


PR 


18 


-JUN- 


1999 


■99US- 


0139462 



PR 


18 


- JUN- 


1999 


PR 


18 


- JUN- 


t n n n 

1999 


PR 


18 


- JUN- 


1999 


PR 


21 


- JUN- 


1999 


PR 


22 


- JUN- 


1999 


PR 


23 


- JUN- 


1999 


PR 


23 


- JUN- 


1999 


PR 


24 


- JUN- 


1999 


PR 


28 


- JUN- 


1999 


PR 


29 


- JUN- 


1999 


PR 


30 


-JUN- 


1999 


PR 


01 


- JUL- 


1999 


PR 


01 


- JUL- 


1999 


PR 


02 


- JUL- 


1999 


PR 


06 


- JUL- 


1999 


PR 


08 


- JUL- 


1999 


PR 


09 


- JUL- 


1999 


PR 


12 


- JUL- 


1999 


PR 


13 


- JUL- 


1999 


PR 


14 


- JUL- 


1999 


PR 


15 


- JUL- 


1999 


PR 


16 


- JUL- 


1999 


PR 


16 


-JUL- 


1999 


PR 


19 


- JUL- 


1999 


PR 


19 


-JUL- 


1999 


PR 


19 


-JUL- 


1999 


PR 


1 9 


- JUL- 


1999 


PR 


19 


- JUL- 


1999 


PR 


19 


- JUL- 


1999 


PR 


20 


- JUL- 


1999 


PR 


20 


- JUL- 


1999 


PR 


20 


- JUL- 


1999 


PR 


21 


-JUL- 


1999 


PR 


21 


- JUL- 


1999 


PR 


21 


- JUL- 


1999 


PR 


22 


- JUL- 


1999 


PR 


22 


-JUL- 


1999 


PR 


22 


- JUL- 


1999 


PR 


22 


- JUL- 


1999 


PR 


23 


-JUL- 


1999 


PR 


23 


-JUL- 


1999 


PR 


23 


- JUL- 


1999 


PR 


26 


-JUL- 


1999 


PR 


27 


- JUL- 


1999 


PR 


27 


-JUL- 


1999 


PR 


27 


- JUL- 


1999 


PR 


28 


- JUL- 


1999 


PR 


02 


-AUG- 


1999 


PR 


02 


-AUG- 


1999 


PR 


02 


-AUG- 


1999 


PR 


03 


-AUG- 


1999 


PR 


04 


-AUG- 


1999 


PR 


04 


-AUG- 


1999 


PR 


05 


-AUG- 


1999 


PR 


05 


-AUG- 


1999 


PR 


06 


-AUG- 


1999 


PR 


06 


-AUG- 


1999 



99US-0139463 
99US-0139750 
99US-0139763 
99US-0139817 
99US-0139899 
99US-0140353 
99US-0140354 
99US-0140695 
99US-0140823 
99US-0140991 
99US-0141287 
99US-0141842 
99US-0142154 
99US-0142055 
99US-0142390 
99US-0142803 
99US-0142920 
99US-0142977 
99US-0143542 
99US-0143624 
99US-0144005 
99US-0144085 
99US-0144086 
99US-0144325 
99US-0144331 
99US-0144332 
99US-0144333 
99US-0144334 
99US-0144335 
99US-0144352 
99US-0144632 
99US-0144884 
99US-0144814 
99US-0145086 
99US-0145088 
99US-0145085 
99US-0145087 
99US-0145089 
99US-0145192 
99US-0145145 
99US-0145218 
99US-0145224 
99US-0145276 
99US-0145913 
99US-0145918 
99US-0145919 
99US-0145951 
99US-0146386 
99US-0146388 
99US-0146389 
99US-0147038 
99US-0147204 
99US-0147302 
99US-0147192 
99US-0147260 
99US-0147303 
99US-0147416 



PR 


09 


-AUG- 


1999, 


99US 


A T A "1 A A 

-01474 93 


PR 


09 


-AUG- 


1999, 


99US 


-0147 935 


PR 


10 


-AUG- 


1999, 


99US 


AT A A T 1 T 

-0148171 


PR 


11 


-AUG- 


1999, 


99US 


-0148319 


PR 


12 


-AUG- 


1999, 


99US 


-014 8341 


PR 


13 


-AUG- 


1999, 


99US 


AT A a c r c 

-014 8565 


PR 


13 


-AUG- 


1999, 


99US 


A *1 A A /" A A 

-014 8 684 


PR 


16 


-AUG- 


1999, 


99US 


AT A A A r A 

-014 9368 


PR 


17 


-AUG- 


1999, 


99US 


A 1 / A 1 1 C 

-014 9175 


PR 


18 


-AUG- 


1999, 


99US 


A 1 A A A A /"" 

-014 9426 


PR 


20 


-AUG- 


1999, 


99US 


AT A r\ *"J O A 

-014 9722 


PR 


20 


-AUG- 


1999, 


99US 


AT A A 1 A A 

-014 9723 


PR 


20 


-AUG- 


1999, 


99US 


AT A A A A A 

-014 9929 


PR 


23 


-AUG- 


1999, 


99US 


A "1 A A A A A 

-014 9902 


PR 


23 


-AUG- 


1999, 


99US 


AT A A A A A 

-014 9930 


PR 


25 


-AUG- 


1999, 


99US 


-0150566 


PR 


26 


-AUG- 


1999, 


99US 


-0150884 


PR 


27 


-AUG- 


1999, 


99US 


-0151065 


PR 


27 


-AUG- 


1999, 


99US 


-0151066 


PR 


27 


-AUG- 


1999, 


99US 


-0151080 


PR 


30 


-AUG- 


1999, 


99US 


-0151303 


PR 


31 


-AUG- 


1999, 


99US 


-0151438 


PR 


01 


-SEP- 


1999, 


99US 


-0151930 


PR 


07 


-SEP- 


1999, 


99US 


-0152363 


PR 


10 


-SEP- 


1999, 


99US 


-0153070 


PR 


13 


-SEP- 


1999, 


99US 


-0153758 


PR 


15 


-SEP- 


1999, 


99US 


-0154018 


PR 


16 


-SEP- 


1999, 


99US 


AT f -/ A A A 

-0154039 


PR 


20 


-SEP- 


1999, 


99US 


-0154779 


PR 


22 


-SEP- 


1999, 


99US 


-0155139 


PR 


23 


-SEP- 


1999, 


99US 


at r - f~ /in/" 

-0155486 


PR 


24 


-SEP- 


1999, 


99US 


-0155659 


PR 


28 


-SEP- 


1999, 


99US 


-0156458 


PR 


29 


-SEP- 


1999, 


99US 


-0156596 


PR 


04 


-OCT- 


1999, 


99US 


-0157117 


PR 


05 


-OCT- 


1999, 


99US 


-0157753 


PR 


06 


-OCT- 


1999, 


99US 


-0157865 


PR 


07 


-OCT- 


1999 


; 99US 


-0158029 


PR 


08 


-OCT- 


1999 


; 99US 


-0158232 


PR 


12 


-OCT- 


1999 


; 99US 


-0158369 


PR 


13 


-OCT- 


1999 


; 99US 


-0159293 


PR 


13 


-OCT- 


1999 


; 99US 


-0159294 


PR 


13 


-OCT- 


1999 


; 99US 


-0159295 


PR 


14 


-OCT- 


1999 


; 99US 


-0159329 


PR 


14 


-OCT- 


1999 


; 99US 


-0159330 


PR 


14 


-OCT- 


1999 


; 99US 


-0159331 


PR 


14 


-OCT- 


1999' 


; 99US 


-0159637 


PR 


14 


-OCT- 


1999 


; 99US 


-0159638 


PR 


18 


-OCT- 


1999 


; 99US 


-0159584 


PR 


21 


-OCT- 


1999 


; 99US 


-0160741 


PR 


21 


-OCT- 


1999 


? 99US 


-0160767 


PR 


21 


-OCT- 


1999 


9 9 US 


-0160768 


PR 


21 


-OCT- 


1999 


; 9 9 US 


-0160770 


PR 


21 


-OCT- 


1999 


99US 


-0160814 


PR 


21 


-OCT- 


1999 


99US 


-0160815 


PR 


22 


-OCT- 


1999 


99US 


-0160980 


PR 


22 


-OCT- 


1999 


99US 


-0160981 



PR 


o o 
11 


-OCT- 


l y y y , 


yyuo- 


Ul bu yo y 


PR 


Zo 


-OCT- 


i y y y , 


none 


A "i c 1 A C\ A 


PR 


25 


-OCT- 


l y y y 


99US- 


U 1 bl 4 UD 


PR 


25 


-OCT- 


19 99/ 


99US- 


Ul bl4 Ud 


PR 


26 


-OCT- 


1999, 


99US- 


0161359 


PR 


26 


-OCT- 


1999, 


99US- 


0161360 


PR 


26 


-OCT- 


1999 


Q QT1C 


U 1 bio bl 


PR 


28 


-OCT- 


1999 


99US- 


u i bi yzu 


PR 


28 


-OCT- 


1999 


99US- 


0161992 


PR 


28 


-OCT- 


1999 


99US- 


0161993 


PR 


29 


-OCT- 


1999 


99US- 


0162142 


Query 


Match 




21.8%; 


Best 


Local 


Similarity 


70.3%; 



Matches 137; Conservative 



Score 91.8; DB 21; Length 928; 
Pred. No. le-15; 
0; Mismatches 57; Indels 1; 



Gaps 



1; 



Qy 222 ggccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggt 281 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 80 gaccaagccgatgtactacttgcaataagcgggttggcctgactggattcaaatgtcgct 539 

Qy 282 gtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgact 341 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 54 0 gtgggagcctcttctgcggaacacaccgctatgcagacgtacatgactgctccttcaatt 599 

Qy 342 aacggactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaag 401 

II I I t I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I 

Db 600 accatgctgctgcgcaagaggcgatagctaa-ggcaaacccggttgtgaaagcagagaag 658 

Qy 402 ctcgacaagatctaa 416 

II I I I II I I II I 
Db 659 cttgacaaaatctga 673 



RESULT 5 
AAC49893 

ID AAC49893 standard; DNA; 986 BP. 
XX 

AC AAC4 9893; 
XX 

DT 18-OCT-2000 (first entry) 
XX 

DE Arabidopsis thaliana DNA fragment SEQ ID' NO: 62834. 
XX 

KW Hybridisation assay; genetic mapping; gene expression control; 
KW protein identification; signal transduction pathway; 
KW metabolic pathway; promoter; termination sequence; ss. 
XX 

OS Arabidopsis thaliana. 
XX 

PN EP1033405-A2. 
XX 

PD 06-SEP-2000. 
XX 

PF 25-FEB-2000; 2000EP-03014 39 . 
XX 

PR 25-FEB-1999; 99US-0121825 . 
PR 05-MAR-1999; 99US-0123180 . 



PR 


09 


-MAR- 


1999 


PR 


23 


-MAR- 


1999 


PR 


25 


-MAR- 


1999 


PR 


29 


-MAR- 


1999 


PR 


01 


-APR- 
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PR 


06 


-APR- 


1999 


PR 


08 


-APR- 


1999 


PR 


16 


-APR- 


1999 


PR 


19 


-APR- 


1999 


PR 


21 


-APR- 


1999 


PR 


23 


-APR- 


1999 


PR 


23 


-APR- 


1999 


PR 


28 


-APR- 


1999 


PR 


30 


-APR- 


1999 


PR 


30 


-APR- 


1999 


PR 


04 


-MAY- 


1999 


PR 


05 


-MAY- 


1999 


PR 


06 


-MAY- 


1999 


PR 


06 


-MAY- 


1999 


PR 


07 


-MAY- 


1999 


PR 


11 


-MAY- 


1999 


PR 


14 


-MAY- 


1999 


PR 


14 


-MAY- 


1999 


PR 


14 


-MAY- 


1999 


PR 


14 


-MAY- 


1999 


PR 


18 


-MAY- 


1999 


PR 


19 


-MAY- 


1999 


PR 


20 


-MAY- 


1999 


PR 


21 


-MAY- 


1999 


PR 


24 


-MAY- 


1999 


PR 


25 


-MAY- 


1999 


PR 


27 


-MAY- 


1999 


PR 


28 


-MAY- 


1999 


PR 


01 


-JUN- 


1999 


PR 


03 


-JUN- 


1999 


PR 


04 


-JUN- 


1999 


PR 


07 


-JUN- 


1999 


PR 


08 


-JUN- 


1999 


PR 


10 


-JUN- 


1999 


PR 


10 


-JUN- 


1999 


PR 


14 


-JUN- 


1999 


PR 


16 


-JUN- 


1999 


PR 


16 


-JUN- 


1999 


PR 


17 


-JUN- 


1999 


PR 


18 


-JUN- 


1999 


PR 


18 


-JUN- 


1999 


PR 


18 


-JUN- 


1999 


PR 


18 


-JUN- 


1999 


PR 


18 


-JUN- 


1999 


PR 


18 


-JUN- 


1999 


PR 


18 


-JUN- 


•1999 


PR 


18 


-JUN- 


•1999 


PR 


18 


-JUN- 


■1999 


PR 


18 


-JUN- 


■1999 


PR 


18 


-JUN- 


■1999 


PR 


18 


-JUN- 


■1999 


PR 


21 


-JUN- 


•1999 



99US-0123548 
99US-0125788 
99US-0126264 
99US-0126785 
99US-0127462 
99US-0128234 
99US-0128714 
99US-0129845 
99US-0130077 
99US-0130449 
99US-0130510 
99US-0130891 
99US-0131449 
99US-0132048 
99US-0132407 
99US-0132484 
99US-0132485 
99US-0132486 
99US-0132487 
99US-0132863 
99US-0134256 
99US-0134218 
99US-0134219 
99US-0134221 
99US-0134370 
99US-0134768 
99US-0134941 
99US-0135124 
99US-0135353 
99US-0135629 
99US-0136021 
99US-0136392 
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PR 09-JUL-1999; 

PR 12-JUL-1999; 

PR 13-JUL-1999; 

PR 14-JUL-1999; 

PR 15-JUL-1999; 

PR 16-JUL-1999; 
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PR 19-JUL-1999; 

PR 19-JUL-1999; 
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PR 27-JUL-1999, 

PR 27-JUL-1999, 
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99US-0144333. 
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99US-0145951 
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Query Match 17.2%; Score 72.6; DB 21; Length 896; 

Best Local Similarity 64.1%; Pred. No. 1.7e-10; 

Matches 125; Conservative 0; Mismatches 69; Indels 1; Gaps 1; 



Qy 222 ggccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggt 281 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ggccgaatcggtgcacgacgtgtaggaaacgggtcgggttaaccggattcaagtgccggt 480 

Qy 282 gtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgact 341 

I I I 1 III I I I I III I I I I I I I I III I I I I I I II I 

Db 4 81 gcggtacgatgttttgcggggttcataggtacccggagatccatggatgcagctacgatt 54 0 

Qy 342 aacggactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaag 401 

I I I I III I I I I I I I I I I I I I I I I II I II I I I I Ml 

Db 541 tcaaatcggccggacgtgaagagatcgcgaa-agcgaatccgttggtgaaagcagcgaag 599 

Qy 402 ctcgacaagatctaa 416 

II I I I I I I I I 

Db 600 cttcagaagatatga 614 



RESULT 9 




AAC36681 




ID 


AAC36681 standard; DNA; 1155 BP. 


XX 






AC 


AAC36681; 




XX 






DT 


17-OCT-2000 


( first entry) 


XX 






DE 


Arabidopsis 


thaliana DNA fragment SEQ ID NO: 14694. 


XX 






KW 


Hybridisation assay; genetic mapping; gene expression control; 


KW 


protein identification; signal transduction pathway; 


KW 


metabolic pathway; promoter; termination sequence; ss. 


XX 






OS 


Arabidopsis 


thaliana . 


XX 






PN 


EP1033405-A2. 


XX 






PD 


06-SEP-2000 




XX 






PF 


25-FEB-2000; 2000EP-03014 39 . 


XX 






PR 


25-FEB-1999, 


99US-0121825. 


PR 


05-MAR-1999, 


99US-0123180. 


PR 


09-MAR-1999, 


99US-0123548. 


PR 


23-MAR-1999, 


99US-0125788. 


PR 


25-MAR-1999, 


99US-0126264. 


PR 


29-MAR-1999, 


99US-0126785. 


PR 


01-APR-1999, 


99US-0127462. 


PR 
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PR 
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99US-0128714 . 


PR 
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99US-0129845. 


PR 
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99US-0130077 . 


PR 
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99US-0130449. 


PR 
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99US-0130510. 


PR 


23-APR-1999, 


99US-0130891. 


PR 
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99US-0131449. 


PR 
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99US-0132048. 


PR 
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99US-0132487 
99US-0132863 
99US-0134256 
99US-0134218 
99US-0134219 
99US-0134221 
99US-0134370 
99US-0134768 
99US-0134941 
99US-0135124 
99US-0135353 
99US-0135629 
99US-0136021 
99US-0136392 
99US-0136782 
99US-0137222 
99US-0137528 
99US-0137502 
99US-0137724 
99US-0138094 
99US-0138540 
99US-0138847 
99US-0139119 
99US-0139452 
99US-0139453 
99US-0139492 
99US-0139454 
99US-0139455 
99US-0139456 
99US-0139457 
99US-0139458 
99US-0139459 
99US-0139460 
99US-0139461 
99US-0139462 
99US-0139463 
99US-0139750 
99US-0139763 
99US-0139817 
99US-0139899 
99US-0140353 
99US-0140354 
99US-0140695 
99US-0140823 
99US-0140991 
99US-0141287 
99US-0141842 
99US-0142154 
99US-0142055 
99US-0142390 
99US-0142803 
99US-0142920 
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99US-0143542 
99US-0143624 
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PR 26-JUL-1999, 

PR 27-JUL-1999, 

PR 27-JUL-1999, 
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99US-0144005. 

99US-0144085. 

99US-0144086. 

99US-0144325. 

99US-0144331. 

99US-0144332. 

99US-0144333. 

99US-0144334 . 

99US-0144335. 

99US-0144352. 

99US-0144632. 

99US-0144884. 

99US-0144814 . 

99US-0145086. 

99US-0145088. 
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99US-0145087 . 
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Query Match 17.2%; Score 72.6; DB 21; Length 1155; 

Best Local Similarity 64.1%; Pred. No. 1.8e-10; 

Matches 125; Conservative 0; Mismatches 69; Indels 1; Gaps 1; 



Qy 222 ggccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggt 281 

I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 683 ggccgaatcggtgcacgacgtgtaggaaacgggtcgggttaaccggattcaagtgccggt 7 42 



Qy 


282 


gtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgact 

1 1 1 1 III 1 1 1 1 III 1 1 1 1 1 1 1 1 III MM 1 1 1 1 1 
gcggtacgatgttttgcggggttcataggtacccggagatccatggatgcagctacgatt 


341 


Db 


743 


802 




*J rt £. 


^ ^ rnna rf net firraflarrarorratt acraacracrct a a t ccaat era t aaaaacacracaao 

1 1 1 1 III 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 III 
tcaaatcggccggacgtgaagagatcgcgaa-agcgaatccgttggtgaaagcagcgaag 


401 


Db 


803 


861 


Qy 


402 


ctcgacaagatctaa 416 

1 1 1 1 1 1 1 1 1 1 
cttcagaagatatga 87 6 




Db 


862 





RESULT 10 
AAC37406 

ID AAC37406 standard; DNA; 1028 BP. 
XX 

AC AAC37406; 
XX 

DT 17-OCT-2000 {first entry) 
XX 

DE Arabidopsis thaliana DNA fragment SEQ ID NO: 17264. 
XX 

KW Hybridisation assay; genetic mapping; gene expression control; 
KW protein identification; signal transduction pathway; 
KW metabolic pathway; promoter; termination sequence; ss. 
XX 

OS Arabidopsis thaliana. 
XX 

PN EP1033405-A2. 



XX 



PD 


06 


-SEP- 


2000. 






XX 












PF 


25 


-FEB- 


2000, 


2000EP- 


0301439. 


XX 












PR 


25 


-FEB- 


1999, 


99US- 


0121825. 


PR 


05 


-MAR- 


1999, 


99US- 


0123180. 


PR 


09 


-MAR- 


1999, 


99US- 


0123548. 


PR 


23 


-MAR- 


1999, 


99US- 


0125788. 


PR 


25 


-MAR- 


1999, 


99US- 


0126264 . 


PR 


29 


-MAR- 


1999, 


99US- 


0126785. 


PR 


01 


-APR- 


1999, 


99US- 


0127462. 


PR 


06 


-APR- 


1999, 


99US- 


0128234. 


PR 


08 


-APR- 


1999, 


99US- 


0128714. 


PR 


16 


-APR- 


1999, 


99US- 


0129845. 


PR 


19 


-APR- 


1999, 


99US- 


0130077. 


PR 


21 


-APR- 


1999, 


99US- 


0130449. 


PR 


23 


-APR- 


1999, 


99US- 


0130510. 


PR 


23 


-APR- 


1999, 


99US- 


0130891. 


PR 


28 


-APR- 


1999, 


99US- 


0131449. 


PR 


30 


-APR- 


1999, 


99US- 


0132048. 


PR 


30 


-APR- 


1999, 


; 99US- 


0132407. 


PR 


04 


-MAY- 


1999 


; 99US- 


0132484 . 


PR 


05 


-MAY- 


1999 


; 99US- 


0132485. 


PR 


06 


-MAY- 


1999 


; 99US- 


0132486. 


PR 


06 


-MAY- 


1999 


; 99US- 


0132487. 


PR 


07 


-MAY- 


1999 


; 99US- 


0132863. 



PR ll-MAY-1999; 

PR 14-MAY-1999, 

PR 14-MAY-1999, 

PR 14-MAY-1999, 

PR 14-MAY-1999, 

PR 18-MAY-1999, 

PR 19-MAY-1999, 

PR 20-MAY-1999, 

PR 21-MAY-1999, 

PR 24-MAY-1999, 

PR 25-MAY-1999, 

PR 27-MAY-1999, 

PR 28-MAY-1999, 

PR 01-JUN-1999, 

PR 03-JUN-1999, 

PR 04-JUN-1999, 

PR 07-JUN-1999, 

PR 08-JUN-1999, 

PR 10-JUN-1999, 

PR 10-JUN-1999. 

PR 14-JUN-1999, 

PR 16-JUN-1999, 

PR 16-JUN-1999, 

PR 17-JUN-1999, 

PR 18-JUN-1999, 

PR 18-JUN-1999, 

PR 18-JUN-1999, 

PR 18-JUN-1999, 

PR 18-JUN-1999 

PR 18-JUN-1999 

PR 18-JUN-1999 

PR 18-JUN-1999 

PR 18-JUN-1999 

PR 18-JUN-1999 

PR 18-JUN-1999 

PR 18-JUN-1999 

PR 21-JUN-1999 

PR 22-JUN-1999 

PR 23-JUN-1999 

PR 23-JUN-1999 

PR 24-JUN-1999 

PR 28-JUN-1999 

PR 29-JUN-1999 

PR 30-JUN-1999 

PR 01-JUL-1999 

PR 01-JUL-1999 

PR 02-JUL-1999 

PR 06-JUL-1999 

PR 08-JUL-1999 

PR 09-JUL-1999 

PR 12-JUL-1999 

PR 13-JUL-1999 

PR 14-JUL-1999 

PR 15-JUL-1999 

PR 16-JUL-1999 

PR 16-JUL-1999 

PR 19-JUL-1999 



99US-0134256. 

99US-0134218. 

99US-0134219. 

99US-0134221. 

99US-0134370. 

99US-0134768. 

99US-0134941. 

99US-0135124 . 

99US-0135353. 

99US-0135629. 

99US-0136021. 

99US-0136392. 

99US-0136782. 

99US-0137222. 

99US-0137528. 

99US-0137502. 

99US-0137724. 

99US-0138094. 

99US-0138540. 

99US-0138847. 

99US-0139119. 

99US-0139452. 

99US-0139453. 

99US-0139492. 

99US-0139454. 

99US-0139455. 

99US-0139456. 

99US-0139457. 

99US-0139458. 

99US-0139459. 

99US-0139460. 

99US-0139461. 

99US-0139462 

99US-0139463 

99US-0139750 

99US-0139763 

99US-0139817 

99US-0139899 

99US--0140353 

99US-0140354 

99US-0140695 

99US-0140823 

99US-0140991 

99US-0141287 

99US-0141842 

99US-0142154 

99US-0142055 

99US-0142390 

99US-0142803 

99US-0142920 

99US-0142977 

99US-0143542 

99US-0143624 

99US-0144005 

99US-0144085 

99US-0144086 

99US-0144325 



PR 


i a 

1 9 


- JUL- 


1999 


99US 


ft 1 A A *3 O 1 

-014 4 3J1 


PR 


T a 

1 9 


- JUL- 


1 n n n 
1999 


99US 


ft 1 A A *3 O ft 

-014 4 332 


PR 


1 9 


- JUL- 


i n a ft 

1999 


99US 


ft T A A O O O 

-0 1 4 4 jjo 


PR 


1 a 

19 


- JUL- 


1 n ci o 
1999 


ft ft TT O 

9 9Uo 


ft 1 A A "3 O A 

-U 1 4 4 334 


PR 


t a 

19 


- JUL- 


i ft n n 
1999 


9 9US 


ft 1 A A 1 Q C 

-0144335 


PR 


20 


- JUL- 


i n ft n 
1999 


99US 


ft 1 A A *3 C ft 

-014 4 352 


PR 


ft A 

20 


- JUL- 


1 n ft ft 
1999 


99US 


ft 1 A A C O ft 

-014 4 632 


PR 


20 


- JUL- 


T ft ft ft 

1999 


99US 


AT A A O O A 

-014 4 884 


PR 


a i 
21 


- JUL- 


i ft ft ft 
1999 


99US 


AT A A O 1 A 

-0144814 


PR 


21 


- JUL- 


1999, 


99US 


A 1 A C A A f 

-0145086 


PR 


21 


- JUL- 


1999, 


99US 


ft 1 A C ft ft O 

-014 5088 


PR 


22 


- JUL- 


1999, 


99US 


ft i a a ft o c 

-014 5085 


PR 


22 


- JUL- 


1999 


99US 


AT A C ft O 1 

-014 508 / 


PR 


22 


- JUL- 


1999 


99US 


AT A C ft O ft 

-014 508 9 


PR 


22 


- JUL- 


1999 


99US 


A T A C T ft O 

-0145192 


PR 


a o 
23 


- JUL- 


1 ft ft ft 
1999, 


99US 


AT A C "\ AC 

-0145145 


PR 


ft *3 

23 


- JUL- 


i o n ft 
1999, 


99US 


AT A C ft 1 O 

-014521b 


PR 


a o 

23 


- JUL- 


i n a ft 

1999, 


99US 


ft 1 A C ft ft j1 

-014 5224 


PR 


2 6 


- JUL- 


1999, 


99US 


ft 1 Vl C A 1 /~ 

-014 527 6 


PR 


27 


-JUL- 


1999, 


99US 


-014 5913 


PR 


27 


- JUL- 


1999, 


99US 


ft 1 A C ft 1 A 

-014 5918 


PR 


27 


- JUL- 


1999 


99US 


ft 1 VI C ft 1 ft 

-014 5919 


PR 


28 


- JUL- 


1999 


99US 


-0145951 


PR 


02 


-AUG- 


1999 


99US 


A 1 A C ft O f 

-014 6386 


PR 


02 


-AUG- 


1999 


99US 


a i a f "i a ft 

-014 6388 


PR 


02 


-AUG- 


1999 


99US 


A "1 vi /** ft ft A 

-014 6389 


PR 


03 


-AUG- 


1999, 


99US 


A 1 A T A ft ft 

-014 7038 


PR 


04 


-AUG- 


1999 


99US 


A 1 VI T ft ft A 

-014 7204 


PR 


04 


-AUG- 


1999 


99US 


ft 1 A *7 ft ft ft 

-014 7 302 


PR 


05 


-AUG- 


1999 


99US 


AT /| T 1 Aft 

-0147192 


PR 


05 


-AUG- 


1999 


99US 


AT A 1 ft /" ft 

-0147260 


PR 


06 


-AUG- 


1999 


99US 


AT il T T m 

-0147303 


PR 


06 


-AUG- 


1999 


99US 


AT J n / 1 /■ 

-0147416 


PR 


09 


-AUG- 


1999 


99US 


AT 4 n ji m 

-01474 93 


PR 


09 


-AUG- 


1999 


99US 


-0147 935 


PR 


10 


-AUG- 


1999 


99US 


-014 8171 


PR 


11 


-AUG- 


1999 


99US 


AT VI A A T A 

-014 8319 


PR 


12 


-AUG- 


1999 


99US 


-014 834 1 


PR 


13 


-AUG- 


1999 


99US 


AT A ft C /" C 

-014 8565 


PR 


13 


-AUG- 


1999 


99US 


ft T Vt A A VI 

-014 8684 


PR 


16 


-AUG- 


1999 


99US 


AT A A A /" A 

-014 9368 


PR 


17 


-AUG- 


1 A ft ft 

1999 


99US 


AT /I ft 1 1 C 

-014 9175 


PR 


18 


-AUG- 


1999 


99US 


AT A ft VI ft C 

-014 942 6 


PR 


20 


-AUG- 


1999 


99US 


AT A A "~ 7 ft ft 

-014 9722 


PR 


20 


-AUG- 


1999 


99US 


-014 9723 


PR 


20 


-AUG- 


1999 


99US 


^\ 1 j| o A A A 

-014 9929 


PR 


23 


-AUG- 


1999 


99US 


r\ T it A A A 

-014 9902 


PR 


23 


-AUG- 


1999 


99US 


-0149930 


PR 


25 


-AUG- 


1999 


99US 


-0150566 


PR 


26 


-AUG- 


1999 


; 99US 


/\ t r a ft ii 

-0150884 


PR 


27 


-AUG- 


1999 


; 99US 


-0151065 


PR 


27 


-AUG- 


1999 


; 99US 


-0151066 


PR 


27 


-AUG- 


1999 


; 99US 


-0151080 


PR 


30 


-AUG- 


1999 


; 99US 


-0151303 


PR 


31 


-AUG- 


1999 


; 99US 


-0151438 


PR 


01 


-SEP- 


1999 


; 99US 


-0151930 


PR 


07 


-SEP- 


1999 


; 99US 


-0152363 



PR 


i a 
1 U 


C T7 1 A 

-bEP- 


1 QAQ 

l y y y 


■ Q A n o 

yyus- 


a i cornn 
01 0 30 / 0 


PR 


1 0 


O T7 D 


1 QQQ 

i y y y 


QQriO 

yyuo- 


01 0 0 / DO 


PR 


T c 
1 0 


c tr 1 n 


i y y y 


yyub- 


AT C A A 1 Q 

0104 Old 


PR 


1 b 


C T? TD 

-bEr- 


1 QQQ 

i y y y 


. Q Q n o 

yyub- 


A 1 C A A A A 

o io4 o j y 


PR 


o a 
20 


C T7" T) 

-bhr- 


T a a a 

i y y y 


• A A M C 

yyub- 


AT E> A "7 "7 Q 

o io4 / / y 


PR 


o a 
22 


-SEP- 


t a a a 

iy yy 


A A M C 

yyub- 


A 1 CC1 OA 

oiooi jy 


PR 


A "3 

2 3 


-SEP- 


i y y y 


yyus- 


0 loo4 a 0 


PR 


A A 

24 


-SEP- 


1 AAA 

i y yy 


A A TT C 

yyus- 


OlOODOy 


PR 


a o 
ZD 


-SEP- 


T AAA 

i y yy 


A A TT A 

yyus- 


A 1 C C A £L O 

0 15 64 DO 


PR 


o a 

2 y 


-SEP- 


T AAA 

i y y y 


A A TT C 

yyus- 


A 1 a C £L Q c 

Oloboyb 


PR 


A yi 

04 


-OCT- 


1 A A A 

i y y y 


99US- 


A 1 C 1 1 1 1 

Olo / 11 / 


PR 


UO 


-OCT- 


1 QQQ 

i y y y 


yyub- 


n 1 c. n "i c *3 
010/ / j j 


PR 


0 6 


-OCT- 


1 A A A 

i y y y 


A A TT C 

yyub- 


A 1 C T O C 

010/ 0 DO 


PR 


U / 


-OCT- 


T A A A 

i y y y 


none 

yyub- 


A 1 t O A A A 

0 loo 02 y 


PR 


A O 

08 


-OCT- 


1 A A A 

19 99 


99US- 


AT C O O O O 

01o82o2 


PR 


T A 

12 


-OCT- 


1 A A A 

1999 


99US- 


A 1 c o o f A 

0158369 


PR 


13 


-OCT- 


T AAA 

1999 


99US- 


A 1 C A A A *3 

0159293 


PR 


13 


-OCT- 


1999 


99US- 


AT C A A A A 

0159294 


PR 


13 


-OCT- 


1999 


99US- 


A 1 C A A A C: 

0159295 


PR 


14 


-OCT- 


1999, 


99US- 


015932 9 


PR 


14 


-OCT- 


1 999 1 


99US- 


A 1 C A *3 O A 

0159330 


PR 


14 


-OCT- 


1999 


99US- 


AT C A O O T 

015 9331 


PR 


14 


-OCT- 


1 A A A 

1999 


99US- 


015963 / 


PR 


T / 

1 4 


-OCT- 


1 A A A 

1 9 y 9 


A A n C 1 

yyus- 


AT CQC3Q 

o i o y D Jd 


PR 


1 O 

la 


-OCT- 


T AAA 


A A IT O 


AT COCO/ 

o i o y oo 4 


PR 


A 1 

2 1 


-OCT- 


"1 A A A 

i y yy 


A A TT O 

yyus- 


AT /T A "7 /I T 

0 1 oU / 4 1 


PR 


A 1 

2 1 


-OCT- 


1 A A A 

19 99 


a A n c 

yyus- 


AT CLCsl CI 
0 1 6U / D / 


PR 


A 1 

21 


-OCT- 


T AAA 

1999 


99US- 


AT cm CO 

0 1 6U /do 


PR 


A T 

2 1 


-OCT- 


T AAA 

l yy y 


A A n c 

yyub- 


at tnTTn 
0 1 DO / / 0 


PR 


O T 

21 


-OCT- 


T AAA 

1999 


99US- 


AT <2 A O T /! 

016U814 


PR 


21 


-OCT- 


T AAA 

1999 


99US- 


AT C A O 1 C 

016U815 


PR 


22 


-OCT- 


1 A A A 

1999 


99US- 


AT C A A O A 

0160 980 


PR 


22 


-OCT- 


1999 


99US- 


AT C A A A T 

0160 981 


PR 


a o 

22 


-OCT- 


1 A A A 

1999 


99US- 


AT C A A O A 

01 60 98 y 


PR 


25 


-OCT- 


1999 


99US- 


A 1 /"I /I A /I 

0161404 


PR 


25 


-OCT- 


1999 


99US- 


AT T /I A d 

0161405 


PR 


25 


-OCT- 


1 AAA 

1999 


99US- 


AT C T A C\ C 

0161406 


PR 


a c 

26 


-OCT- 


1 A A A 

1999 


99US- 


AT A C A 

0161359 


PR 


26 


-OCT- 


1999 


99US- 


0161360 


PR 


26 


-OCT- 


1999 


99US- 


0161361 


PR 


28 


-OCT- 


1999 


99US- 


0161920 


PR 


28 


-OCT- 


1999 


99US- 


0161992 


PR 


28 


-OCT- 


1999 


99US- 


0161993 


PR 


29 


-OCT- 


1999 


99US- 


0162142 



Query Match 16.9%; Score 71; DB 21; Length 1028; 

Best Local Similarity 63.6%; Pred. No. 4.9e-10; 

Matches 124; Conservative 0; Mismatches 70; Indels 1; Gaps 1; 

Qy 222 ggccgaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggt 281 

I M II I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 4 61 ggccgaatcggtgcacgacgtgtaggaaacgggtcgggttaaccggattcaagtgccggt 520 

Qy 282 gtgggaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgact 341 

III III I I I I III I Ml 1 M I II I I I I I I I I I I 
Db 521 tcggtacgatgttttgcggggttcataggtacccggagatccatggatgcagctacgatt 580 



Qy 342 aacggactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaag 4 01 

I I I I III I I I I I I I I I I I I M I I I I I I I I I I I III 

Db 581 tcaaatcggccggacgtgaagagatcgcgaa-agcgaatccgttggtgaaagcagcgaag 639 

Qy 402 ctcgacaagatctaa 416 

II I I I I I I I I 
Db 640 cttcagaagatatga 654 



RESULT 11 
AAC35523 
ID 
XX 



AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 



AAC35523 standard; 
AAC35523; 



DNA; 830 BP. 



17-OCT-2000 {first entry) 

Arabidopsis thaliana DNA fragment SEQ ID NO: 10502. 

Hybridisation assay; genetic mapping; gene expression controls- 
protein identification; signal transduction pathways- 
metabolic pathway; promoter; termination sequence; ss. 

Arabidopsis thaliana. 

EP1033405-A2. 



06-SEP-2000. 



25-FEB-2000; 2000EP-03014 39 . 



25-FEB-1999 

05- MAR-1999 
09-MAR-1999 
23-MAR-1999 
25-MAR-1999 

29- MAR-1999 
01-APR-1999 

06- APR-1999 
08-APR-1999 
16-APR-1999 
19-APR-1999 
21-APR-1999 
23-APR-1999 
23-APR-1999 
28-APR-1999 

30- APR-1999 
30-APR-1999 

04- MAY-1999 

05- MAY-1999 

06- MAY-1999 

06- MAY-1999 

07- MAY-1999 
ll-MAY-1999 
14-MAY-1999 
14-MAY-1999 
14-MAY-1999 



99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 
99US- 



0121825. 
0123180. 
0123548. 
0125788. 
0126264. 
0126785. 
0127462. 
0128234. 
0128714 . 
0129845. 
0130077. 
0130449. 
0130510. 
0130891. 
0131449. 
0132048. 
0132407. 
■0132484 . 
0132485. 
0132486. 
■0132487. 
0132863. 
■0134256. 
0134218. 
■0134219. 
■0134221. 
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- JUN- 
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PR 
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1 AAA 
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PR 
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- JUN- 
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PR 


18 


- JUN- 


1 A A A 
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PR 


18 


-JUN- 


1999 


PR 


T O 
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PR 
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- JUN- 
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PR 
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- JUN- 
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PR 
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- JUN- 
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1999 
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18 


- JUN- 
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- JUN- 


1999 


PR 
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- JUN- 
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1 AAA 

1999 
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1 AAA 

1999 


PR 
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1 AAA 

1999 


PR 


23 


- JUN- 


1999 


PR 
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- JUN- 


1999 


PR 
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- JUN- 


1999 


PR 


28 


- JUN- 


1999 


PR 


29 


- JUN- 
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PR 


30 


-JUN- 
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PR 


01 


- JUL- 


1999 


PR 


01 


- JUL- 


1999 


PR 


02 


- JUL- 


1999 


PR 


06 


- JUL- 


1999 


PR 


08 


- JUL- 


1999 


PR 


09 


- JUL- 


1999 


PR 


12 


- JUL- 


1999 


PR 


13 


-JUL- 
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PR 


14 


- JUL- 
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PR 


15 


- JUL- 
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PR 


16 


- JUL- 
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PR 
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-JUL- 


•1999 


PR 


19 


-JUL- 
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PR 


19 


-JUL- 


■1999 


PR 


19 


-JUL- 


-1999 



99US-0134370 
99US-0134768 
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Query Match 16.4%; Score 69.2; DB 21; Length 830; 

Best Local Similarity 63.7%; Pred. No. 1.4e-09; 

Matches 121; Conservative 0; Mismatches 68; Indels 1; Gaps 1; 

Qy 227 aaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtggg 286 

I I I I I I I I I I Mill! I I I I I I I I I I I I I I I I I I I I I I I I 
Db 394 aaccgatgttctggttgtcggaagaaagttgggttgaccggattcaggtgccgatgcggt 453 

Qy 287 aacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacgg 34 6 

I I I I I I I I I I I I I I I I I I I I I I ■ I I I I I I I I ! I I I I I 
Db 454 gagcttttctgctcagagcaccgttactccgatcgtcatgattgtagctacgactacaaa 513 

Qy 34 7 actgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcga 4 06 

MINI I I I I I I I I I I I I I I I I I I I PI I I I I I I I II II 

Db 514 accgctggtcgtgaggcgatcg-ctagagaaaatccggtggtcaaggctgcgaaaatggt 57 2 



Qy 407 caagatctaa .416 

III I I I I 
Db 573 caaagtttaa 582 



RESULT 12 
AAC57020 

ID AAC57020 standard; DNA; 348 BP. 
XX 

AC AAC57020; 
XX 

DT 25-JAN-2001 (first entry) 
XX 

DE Pinus radiata transcription factor DNA sequence #466. 
XX 

KW Plant; transcription factor; gene expression; eucalyptus; pine; acacia; 

KW poplar; sweetgum; teak; mahogany; bZIP; G-box binding factor; 

KW basic helix-loop-helix zipper; homeotic; homeo domain; homeobox; MADS; 

KW homeodomain zipper; LIM domain; AP2 ; EREBs; zinc finger domain; 

KW type 2 Cys2His2; CCAAT box element; MYB; ss. 

XX 

OS Pinus radiata. 
XX 

PN WO200053724-A2 . 
XX 

PD 14-SEP-2000. 
XX 

PF 09-MAR-2000; 2000WO-US061 12 . 
XX 

PR ll-MAR-1999; 99US-02 6651 3 . 

PR 18-AUG-1999; 99US-01 4 94 85 . 
XX 

PA (GENE- ) GENESIS RES & DEV CORP LTD. 

PA (FLET-) FLETCHER CHALLENGE FORESTS LTD. 

XX 

PI Wood M, McGrath A, Shenk MA, Glenn M; 
XX 

DR WPI; 2000-579369/54. 
XX 

PT New isolated polynucleotide encoding a plant transcription factor for 

PT producing a plant e.g. a woody plant, preferably eucalyptus or pine, 

PT having modified gene expression or modified activity of a polypeptide 
PT 
XX 

PS Claim 1; Pages 567-568; 747pp; English. 
XX 

CC The present invention relates to novel plant transcription factors from 

CC Eucalyptus grandis or Pinus radiata. The present sequence is the coding 

CC sequence for one such transcription factor. The transcription factor may 

CC be used to produce a plant having modified gene expression such as a 

CC woody plant e.g. a eucalyptus, pine, acacia, poplar, sweetgum, teak, or 

CC mahogany species or to modify the activity of a polypeptide in a plant. 

CC The transcription factors of the present invention are members from the 

CC following families of regulatory proteins: bZIP, bZIP family of G-box 

CC binding factors, basic helix-loop-helix zipper, 

CC homeotic/homeodomain/homeobox/MADS, homeodomain zipper, LIM domain, AP2 

CC and EREBs, zinc finger domains of type 2 Cys2His2, CCAAT box elements 



CC and MYB . 
XX 

SQ Sequence 348 BP; 81 A; 92 C; 107 G; 68 T; 0 other; 



Query Match 16.2%; Score 68; DB 21; Length 348; 

Best Local Similarity 63.0%; Pred. No. 2.4e-09; 

Matches 104; Conservative 0; Mismatches 61; Indels 0; Gaps 

Qy 207 aaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttggacttacgg 266 

II I I I I I III I I I I I I I I I I ill I 

Db 184 aacagctgaggaagagcaagaagaggtgttttgcgtgtaacaggcgggtggggctgacgg 243 

Qy 267 gattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccgacaagcacg 326 

I I I I 1 I 1 I I I M I I I II I I llllll I I II I M I I I I I I I I I I 
Db 244 gctttaagtgccgctgtggtgaccttttctgcgctcagcacaggtactctgatatgcatg 303 

Qy 327 actgcaagttcgactaacggactgctgccaaggacgccattgcca 371 

I I I I I M I I I I I I II I I I I I II I I I I I II 

Db 304 actgctcttttgactacaagactgccggccgcctcgccattctca 348 



0; 



RESULT 13 




AAC40557 




ID 


AAC40557 standard; DNA; 862 BP. 


XX 






AC 


AAC40557; 




XX 






DT 


17-OCT-2000 


(first entry) 


XX 






DE 


Arabidopsis 


thaliana DNA fragment SEQ ID NO: 28736. 


XX 






KW 


Hybridisation assay; genetic mapping; gene expression control; 


KW 


protein identification; signal transduction pathway; 


KW 


metabolic pathway; promoter; termination sequence; ss. 


XX 






OS 


Arabidopsis 


thaliana . 


XX 






PN 


EP1033405-A2. 


XX 






PD 


06-SEP-2000. 




XX 






PF 


25-FEB-2000, 


2000EP-0301439. 


XX 






PR 


25-FEB-1999, 


99US-0121825. 


PR 


05-MAR-1999, 


99US-0123180. 


PR 


09-MAR-1999, 


99US-0123548. 
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23-MAR-1999, 


99US-0125788. 
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01-APR-1999, 


99US-0127462. 


PR 


06-APR-1999 
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Query Match 15.9%; Score 67; DB 21; Length 862; 

Best Local Similarity 62.8%; Pred. No. 5.7e-09; 

Matches 120; Conservative 0; Mismatches 70; Indels 1; Gaps 
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Qy 231 ggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtgggaact 2 90 

I I I I I I I I MM M M I I I M M M M I M M M I M I I 

Db 47 6 gatgcttgagctgtaacaagaaagttggcgtgacgggtttcaagtgtaggtgtggaagca 535 

Qy 2 91 tgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacggactg 350 

I M I I I M I M MM M M M M M M M I I 
Db 536 ctttctgtggaactcatagatatccagagagtcacgagtgccaattcgatttcaaaggag 595 

Qy 351 ctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcgacaag 410 

MM M M M M M I M M M M M M M M I M M M I M M I 
Db 596 ttgcgagagaagctattgctaa-agctaatcctgtggtgaaggctgataaggttgatagg 654 

Qy 411 atctaaggggg 421 

Mil II 
Db 655 atatgaattgg 665 
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99US- 


0160741 . 


PR 


21 


-OCT- 


1999 


99US- 


0160767 . 


PR 


21 


-OCT- 


1999 


99US- 


0160768 . 


PR 


21 


-OCT- 


1999 


99US- 


0160770 . 


PR 


21 


-OCT- 


1999 


99US- 


0160814 . 


PR 


21 


-OCT- 


1999 


; 99US- 


0160815 . 


PR 


22 


-OCT- 


1999 


; 99US- 


0160980 . 


PR 


22 


-OCT- 


1999 


; 99US- 


0160981 . 


PR 


22 


-OCT- 


1999 


; 99US- 


0160989 . 


PR 


25 


-OCT- 


1999 


; 99US- 


0161404 . 


PR 


25 


-OCT- 


1999 


; 99US- 


0161405 . 


PR 


25 


-OCT- 


1999 


; 99US- 


0161406 . 


PR 


26 


-OCT- 


1999 


; 99US- 


0161359 . 


PR 


26 


-OCT- 


1999 


99US- 


0161360. 


PR 


26 


-OCT- 


1999 


99US- 


0161361. 


PR 


28 


-OCT- 


-1999 


99US- 


0161920. 


PR 


28 


-OCT- 


-1999 


99US- 


■0161992. 


PR 


28 


-OCT- 


■1999 


99US- 


■0161993. 


PR 


29 


-OCT- 


■1999 


99US- 


■0162142. 



Query Match 15.9%; Score 66.8; DB "21; Length 594; 

Best Local Similarity 63.4%; Pred. No. 5.8e-09; 

Matches 118; Conservative 0; Mismatches 67; Indels 1; Gaps 1; 

Qy 231 ggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtgggaact 290 

I I I I III I I I I I I I I I I I I I I I I I I I I I ! I I I I I I 1 I I I 

Db 410 gatgcttgagctgtaacaagaaagttggcgtgacgggtttcaagtgtaggtgtggaagca 4 69 



Qy 291 tgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacggactg 350 

I I I I I I II III I I I I I I I I I I I I I I I I I I I I 
Db 470 ctttctgtggaactcatagatatccagagagtcacgagtgccaattcgatttcaaaggag 529 

Qy 351 ctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcgacaag 410 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 530 ttgcgagagaagctattgctaa-agctaatcctgtggtgaaggctgataaggttgatagg 588 

Qy 411 atctaa 416 

I I I I 
Db 589 atatga 594 



RESULT 15 
AAH22181 

ID AAH22181 standard; cDNA; 1425 BP. 
XX 

AC AAH22181; 
XX 

DT 20-AUG-2001 (first entry) 
XX 

DE Human zinc finger protein (hZNF216) nucleotide sequence. 
XX 

KW Human; hZNF216-iso; hZNF216; zinc finger protein; ss. 
XX 

OS Homo sapiens . 
XX 

PN CN1281898-A. 
XX 

PD 31-JAN-2001. 
XX 

PF 27-JUN-2000; 2000CN-011 67 98 . 
XX 

PR 27-JUN-2000; 2000CN-011 67 98 . 
XX 

PA (NANF-) NAN FANG RES CENT STATE HUMAN GENE GROUP. 
XX 

PI Li N, Xu X, Xiao H; 
XX 

DR WPI; 2001-282784/30. 

DR P-PSDB; AAB98228 . 
XX 

PT New human zinc finger protein isomer, hZNF216, polypeptide and 

PT polynucleotide useful for detecting hZNF216 

XX 

PS Example 2; Fig 2; 23pp; Chinese. 
XX 

CC The present invention describes a human zinc finger protein isomer, 

CC designated hZNF216-iso. Also described are methods for the preparation 

CC and detection of hZNF216-iso protein and nucleotide sequences. The 

CC present sequence represents the human zinc finger protein hZNF216 

CC nucleotide sequence which is given in comparison with the hZNF216-iso 

CC nucleotide sequence in an example from the present invention. 

XX 

SQ Sequence 1425 BP; 464 A; 279 C; 300 G; 382 T; 0 other; 



Query Match 15.3%; Score 64.4; DB 22; 

Best Local Similarity 62.1%; Pred. No. 3.3e-08; 
Matches 118; Conservative 0; Mismatches 71; 



Length 1425; 
Indels 1; Gaps 



1; 



Qy 


227 


Db 


492 


Qy 


287 


Db 


552 


Qy 


347 


Db 


612 


Qy 


407 


Db 


671 



I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 



I I I I I I I I I • I I I I I I I I I I I I I I I I I I I M I II I I II MM 



II II II II I I I I I I M I I III 



I I I II I 



Search completed: February 7, 2002, 11:00:03 
Job time: 4989 sec 

GenCore version 4.5 
Copyright (c) 1993 - 2000 Compugen Ltd. 

OM nucleic - nucleic search, using sw model 

Run on: February 7, 2002, 10:51:52 ; Search time 172.96 Seconds 

(without alignments) 
551.268 Million cell updates/sec 

Title: US-0 9-3 94 -7 4 5-64 8 9 

Perfect score: 421 

Sequence: 1 ggggccagcacgcgtccggc ctcgacaagatctaaggggg 421 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 351203 seqs, 113238999 residues 

Total number of hits satisfying chosen parameters: 702406 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB . seq: * 
2 : /cgn2__6/ptodata/2/ina/5B_COMB . seq: * 



/cgn2_6/ptodata/2/ina/6A_COMB. seq: * 
/cgn2_6/ptodata/2/ina/6B_COMB. seq: * 
/cgn2_6/ptodata/2/ina/PCTUS_COMB. seq: * 
/cgn2_6/ptodata/2/ina/backf ilesl . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No . 


Score 


Match 


Length DB 


ID 


Description 




1 


60 . 6 


14 


.4 


1197 


1 


no A <"i 0/~*1 A /" A /I 

US -08-861 -2 69-4 


Sequence 


4, Appli 




2 


60 . 6 


14 


.4 


1197 


2 


t 1 o A A i 1 J fT A /" X 

US-09-134-596-4 


Sequence 


4, Appli 




3 


60 . 6 


14 


.4 


1197 


3 


nn Art A A O OT3 /I 

US -09-293-273-4 


Sequence 


4, Appli 


c 


4 


35 . 8 


8 


.5 


7218 


1 


no AO A O O A f n 1 A 

US -08-232-4 63-14 


Sequence 


14, Appl 




5 


35 . 4 


8 


.4 


4411529 


4 


no i a o o / aa i 

US-0 9-10o-o4 UA-1 


Sequence 


1, Appli 




6 ' 


34 . 6 


8 


.2 


459 


2 


r •» /*\ O inn A /I A /'I O. *7 

US-08-387-942C-37 


Sequence 


37, Appl 




7 


34.6 


8 


.2 


12588 


2 


no AO "DOT A vl A T 

US-08-io /-94ZC-1 


Sequence 


1, Appli 




8 


34.4 


8 


.2 


750 


3 


no AA 1 /" 1 T1 T A 

US-09-1 d /- 11 1-6 


Sequence 


3, Appli 




9 


33 . 2 


7 


.9 


80161 


3 


no AA AO/" AOTti 1 

US-Q9-0J6-98 /A-l 


Sequence 


1, Appli 




10 


33 . 2 


7 


.9 


80161 


4 


US -09-370-7 00-1 


Sequence 


1, Appli 




11 


33 . 2 


7 


. 9 


4403765 


4 


US-0 9-103-84 OA- 2 


Sequence 


s 2, Appli 




12 


33 


7 


.8 


2899 


4 


no /"\ O ft A 1 AAA A /l 

US -08-981-392-24 


Sequence 


24, Appl 




13 


32 . 4 


7 


.7 


4601 


1 


n n AA VITA ACO A 

US -08-470-058-3 


Sequence 


3, Appli 




14 


32 . 4 


7 


.7 


4601 


3 


US-09-037-188-3 


Sequence 


3, Appli 




15 


32 . 4 


7 


.7 


4601 


4 


US -09-285-310-3 


Sequence 


3, Appli 




16 


32 . 4 


7 


.7 


5095 


1 


t -r o o o /"\ r\ o (^111 o 

US -08-092-817-3 


Sequence 


3, Appli 


c 


17 


32 . 4 


7 


.7 


50937 


4 


no /"\ <"i /I O O f™ T "1 1 

US-09-428-517-1 


Sequence 


1, Appli 


c 


18 


32 . 2 


7 


.6 


1431 


1 


no O OCX ACT 1 

US-08-254-357-1 


Sequence 


1, Appli 


c 


19 


32 . 2 


7 


.6 


4983 


1 


no OO Jl O. A IT A "1 

US -08-472-358-1 


Sequence 


1, Appli 


c 


20 


32 . 2 


7 


.6 


4983 


5 


PCT-US 92-057 8 6A-1 


Sequence 


1, Appli 


c 


21 


31. 8 


7 


. 6 


2015 


1 


US -07-937-609-25 


Sequence 


25, Appl 


c 


22 


31. 8 


7 


. 6 


2015 


4 


US -08-02 9-170-25 


Sequence 


25, Appl 


c 


23 


31. 6 


7 


.5 


1950 


3 


US-08-911-853-14 


Sequence 


14, Appl 


c 


24 


31. 6 


7 


.5 


1950 


4 


US-09-479-409-14 


Sequence 


14, Appl 




25 


31. 6 


7 


.5 


5247 


1 


US-08-920-812-15 


Sequence 


15, Appl 




26 


31. 6 


7 


.5 


5247 


1 


US-08-920-827-15 


Sequence 


15, Appl 




27 


31.6 


7 


.5 


5247 


1 


US-08-921-177-15 


Sequence 


15, Appl 




28 


31. 6 


7 


.5 


5247 


1 


US-08-362-577C-15 


Sequence 


15, Appl 




29 


31. 6 


7 


.5 


5247 


2 


US-08-920-828-15 


Sequence 


15, Appl 




30 


31.6 


7 


.5 


17612 


3 


USt-08-911-853-29 


Sequence 


2 9, Appl 




31 


31. 6 


7 


.5 


17612 


4 


US-09-479-409-29 


Sequence 


29, Appl 


c 


32 


31. 6 


7 


. 5 


4403765 


4 


US-09-103-840A-2 


Sequence 2, Appli 


c 


33 


31 


7 


. 4 


1195 


1 


US-08-117-373-5 


Sequence 


5 , App 1 i 


c 


34 


31 


7 


.4 


3914 


1 


US-08-117-373-11 


Sequence 


11, Appl 




35 


31 


7 


.4 


11220 


4 


US-09-105-537-32 


Sequence 


32, Appl 




36 


31 


7 


. 4 


36778 


4 


US-09-105-537-5 


Sequence 


5, Appli 




37 


31 


7 


.4 


38506 


3 


US-09-320-878-19 


Sequence 


19, Appl 


c 


38 


31 


7 


.4 


44377 


2 


US-08-804-227C-7 


Sequence 


7, Appli 


c 


39 


31 


7 


.4 


44377 


2 


US-08-804-198-1 


Sequence 


1, Appli 


c 


40 


30.8 


7 


.3 


8201 


1 


US-08-253-155A-9 


Sequence 


9, Appli 


c 


41 


30.6 


7 


.3 


1182 


1 


US-08-241-943-23 


Sequence 


23, Appl 


c 


42 


30.6 


7 


.3 


2327 


6 


5229279-5 


Patent No 


5229279 


c 


43 


30.6 


7 


.3 


2327 


6 


5512669-5 


Patent No 


5512669 



4 
5 
6 



c 44 30.6 7.3 4984 1 US-08-687-806-1 Sequence 1, Appli 

c 45 30.6 7.3 5392 2 US-08-4 03-852D-1 Sequence 1, Appli 



ALIGNMENTS 



RESULT 1 
US-08-861-269-4 

Sequence 4, Application US/08861269 
Patent No. 5817494 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Corley, Neil C. 
APPLICANT: Lai, Preeti 
APPLICANT: Shah, Purvi 

TITLE OF INVENTION: UBIQUITIN CONJUGATION PROTEINS 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: USA 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/861,269 
FILING DATE: Filed Herewith 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: ■ 36,749 
REFERENCE/DOCKET NUMBER: PF-0302 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-84 5-4166 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1197 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: BRAINOT09 
CLONE: 2151473 
US-08-861-269-4 



Query Match 14.4%; 
Best Local Similarity 60.7%; 



Score 60.6; DB 1; Length 1197; 
Pred. No. 3.8e-08; 



Matches 116; Conservative 0; Mismatches 74; Indels 1; Gaps 1; 



Qy 226 gaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtgg 285 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I II I I 
Db 699 GAATCGCTGTTTCATGTGCAGGAAGAAAGTGGGACTTACTGGGTTTGAATGCCGGTGTGG 758 

Qy 28 6 gaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacg 34 5 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 759 AAATGTTTACTGTGGTGTACACCGTTACTCAGATGTACACAATTGCTCTTACAATTACAA 818 

Qy 34 6 gactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcg 405 

I I I I I III II II I II I I I I I I I I I I I II I I 
Db 819 AGCCGATGCTGCTGA-GAAAATCAGAAAAGAAAATCCAGTAGTTGTTGGTGAAAAGATCC 87 7 

Qy 406 acaagatctaa 416 

I I I I I I I I 
Db 87 8 AAAAGAT T T G A 88 8 



RESULT 2 
US-09-134-596-4 

Sequence 4, Application US/09134596 
Patent No. 5922318 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Corley, Neil C. 
APPLICANT: Lai, Preeti 
APPLICANT: Shah, Purvi 

TITLE OF INVENTION: UBIQUITIN CONJUGATION PROTEINS 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: USA 
ZIP : 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/134,596 
FILING DATE: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/861,269 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0302 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-84 5-4166 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 



LENGTH: 1197 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: BRAINOT09 
CLONE: 2151473 
US-09-134-596-4 



Query Match 14.4%; Score 60.6; DB 2; Length 1197; 

Best Local Similarity 60.7%; Pred. No. 3.8e-08; 



Matches 


116; Conservative 0; Mismatches 74; Indels 1; Gaps 


Qy 

Db 


226 
699 


gaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtgg 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GAATCGCTGTTTCATGTGCAGGAAGAAAGTGGGACTTACTGGGTTTGAATGCCGGTGTGG 


285 
758 


Qy 


286 


gaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacg 

II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 Ml 1 1 1 1 1 
AAATGTTTACTGTGGTGTACACCGTTACTCAGATGTACACAATTGCTCTTACAATTACAA 


345 


Db 


759 


818 


Qy 


346 


gactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcg 

Mill III II II 1 1 1 1 II 1 1 II 1 1 1 1 1 1 M 
AGCCGATGCTGCTGA-GAAAATCAGAAAAGAAAATCCAGTAGTTGTTGGTGAAAAGATCC 


405 


Db 


819 


877 


Qy 


406 


acaagatctaa 416 

1 1 1 1 1 1 i 1 
AAAAG AT T T G A 888 




Db 


878 





RESULT 3 
US-09-293-273-4 

Sequence 4, Application US/09293273 
Patent No. 6057112 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Corley, Neil C. 
APPLICANT: Lai, Preeti 
APPLICANT: Shah, Purvi 

TITLE OF INVENTION: UBIQUITIN CONJUGATION PROTEINS 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: USA 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/293,273 
FILING DATE: 



PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/134,596 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0302 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
/ INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1197 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

LIBRARY: BRAINOT09 

CLONE: 2151473 
US-09-293-273-4 



Query Match 14.4%; Score 60.6; DB 3; Length 1197; 

Best Local Similarity 60.7%; Preci. No. 3.8e-08; 

Matches 116; Conservative 0; Mismatches 74; Indels 1; Gaps 1; 

Qy 226 gaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtgg 285 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 699 GAATCGCTGTTTCATGTGCAGGAAGAAAGTGGGACTTACTGGGTTTGAATGCCGGTGTGG 758 

Qy 286 gaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacg 345 

II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 759 ' AAATGTTTACTGTGGTGTACACCGTTACTCAGATGTACACAATTGCTCTTACAATTACAA 818 

Qy 346 gactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcg 405 

I I I I I Ml II II I I I I I I I I I I I II I M II 
Db 819 AGCCGATGCTGCTGA-GAAAATCAGAAAAGAAAATCCAGTAGTTGTTGGTGAAAAGATCC 877 

Qy 406 acaagatctaa 416 

I I I I I I I I 
Db 87 8 AAAAGAT T T G A 888 



RESULT 4 
US-08-232-463-14/C 

; Sequence 14, Application US/08232463 

; Patent No. 5670367 

; GENERAL INFORMATION: 

APPLICANT : DORNER, F . 

APPLICANT: SCHEIFLINGER, F. 

APPLICANT: FALKNER, F. G. 

TITLE OF INVENTION: RECOMBINANT FOWLPOX VIRUS 
NUMBER OF SEQUENCES: 52 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Foley & Lardner 

STREET: 1800 Diagonal Road, Suite 500 

CITY: Alexandria 



STATE : VA 
COUNTRY: USA 
ZIP: 22313-0299 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/232,463 
FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/07/935,313 
FILING DATE: 

APPLICATION NUMBER: EP 91 114 300.6 
FILING DATE: 26-AUG-1991 
ATTORNEY/AGENT INFORMATION: 
NAME: BENT, Stephen A. 
REGISTRATION NUMBER: 29,768 
REFERENCE/DOCKET NUMBER: 30472/114 IMMU 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (703) 836-9300 
TELEFAX: (703) 683-4109 
TELEX: 899149 
INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 7218 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 
CLONE: pTZgpt-Fls 
US-08-232-463-14 

Query Match 8.5%; Score 35.8; DB 1; Length 7218; 

Best Local Similarity 1.9%; Pred. No. 0.69; 

Matches 7; Conservative 203; Mismatches 156; Indels 0; Gaps 0; 

Qy 56 agcatcgtgaacggcagcgacgccgtcatggagccggttgttgctggcagcaacacggta 115 

| | | | |||::::::: : : : : : : : : : : :::::::::: 
Db 14 4 6 AGAATTTGGTACRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1387 

Qy 116 gtagctgttgcccaagttgagttgcaaacaatgaacgtgcagcagcccgctgatgttgcc 175 

Db 138 6 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1327 

Qy 17 6 ggacccagcgagggggtggcggcgatctccaaaggggggaaggtanggccgaaccggtgc 235 

Db 132 6 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 12 67 

Qy 236 agcgcctgcaggaagagggttggacttacgggattcaactgccggtgtgggaacttgtac 295 

Db 12 66 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1207 



Qy 



2 96 tgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacggactgctgcc 355 



Db 120 6 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 114 7 
Qy 356 aaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcgacaagatcta 415 

Db 114 6 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1087 
Qy 416 aggggg 421 



Db 1086 RRRRRR 1081 



RESULT 5 
US-09-103-840A-1 

; Sequence 1, Application US/09103840A 

; Patent No. 6294328 

; GENERAL INFORMATION: 

; APPLICANT: FLEISCHMAN, Robert D. 

; APPLICANT: WHITE, Owen R. 

; APPLICANT: FRASER, Claire M. 

; APPLICANT: VENTER, John C. 

; TITLE OF INVENTION: DNA SEQUENCES FOR STRAIN ANALYSIS IN MYCOBACTERIUM 

; TITLE OF INVENTION: TUBERCULOSIS 

; FILE REFERENCE: 24366-20007.00 

; CURRENT APPLICATION NUMBER: US /0 9 /l 03 , 8 4 OA 

; CURRENT FILING DATE: 1998-06-24 

; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 1 

LENGTH: 4411529 

TYPE: DNA 

ORGANISM: Mycobacterium tuberculosis 
OTHER INFORMATION: H37Rv 
US-09-103-840A-1 



Query Match 8.4%; Score 35.4; DB 4; Length 4411529; 

Best Local Similarity 48.8%; Pred. No. 8.1; 

Matches 123; Conservative 0; Mismatches 127; Indels 2; Gaps 1; 

Qy 8 gcacgcgtccggcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaac 67 

II I I I I I I I Ml I III I I I I I Ml III I 

Db 338 607 gccgccggccccgccgttgacgccggccgcgccggatcctccggcgccgccgttgccgat 338 666 

Qy 68 ggcagcgacgccgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcc 127 

I I I I I I I I I I I II I I I I I I I MINIM 

Db 338667 taaccagccgccgtccccgccattggccccggtgccgggggcgccgttggtgccgttgcc 3387 2 6 

Qy 128 caagttgagttgcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgag 187 

I III II I I I I I I I I If I III Mill 

Db 338727 gatcagcgggcgcccggtattcgccaggaagaactcgttgatcggggcg--agcagcggc 338784 

Qy 188 ggggtggcggcgatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcagg 24 7 

I I I I II I II I I Ml I I I I I I III I M I I I I I I I I I 

Db 3387 85 gaggtggcggcggcctcggcggcggcgtacgcgccgccaccggaggtcaacgcctgcacg 33884 4 



Qy 248 aagagggttgga 259 



Db 338845 aactgggcatga 338856 



RESULT 6 
US-08-387-942C-37 

Sequence 37, Application US/08387942C 
Patent No. 5939289 
GENERAL INFORMATION: 

APPLICANT: ERTESVAG, HELGA 
APPLICANT: VALLA, SVEIN 
APPLICANT: SKJAK-BRAEK, GUDMUND 
APPLICANT: LARSEN, BJORN 

TITLE OF INVENTION: DNA COMPOUNDS COMPRISING SEQUENCES 
TITLE OF INVENTION: ENCODING MANNURONAN C-5-EPIMERASE 
NUMBER OF SEQUENCES: 52 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: BIRCH, STEWART, KOLASCH & BIRCH, LLP 
STREET: P.O.BOX 747 
CITY: FALLS CHURCH 
STATE: VA 
COUNTRY: USA 
ZIP: 22042 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /387 , 94 2C 
FILING DATE: 09-MAY-1995 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: MURPHY JR, GERALD M. 
REGISTRATION NUMBER: 28,977 
REFERENCE/DOCKET NUMBER: 1809-106P 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 703-205-8000 
TELEFAX: 703-205-8050 
INFORMATION FOR SEQ ID NO: 37: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 459 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
US-08-387-942C-37 



Query Match 8.2%; Score 34.6; DB 2; Length 459; 

Best Local Similarity 48.5%; Pred. No. 0.56; 

Matches 94; Conservative 0; Mismatches 100; Indels 0; Gaps 0; 

Qy 171 ttgccggacccagcgagggggtggcggcgatctccaaaggggggaaggtanggccgaacc 230 

I I I I I I I I I I I I I III III I I I I I I I I I I I I 
Db 2 9 TTTCCGGCAGCAGCGCGGCCGAGGAACTGCTCGGCGGGGTCGGCAACGACAGCCTGGACG 88 



Qy 231 ggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtgggaact 290 

I I I I I I I I III I I I I I I I I I I I I I I I 

Db 8 9 GCGGCGCCGGCAACGACATCCTCGACGGCGGGGCGGGGCGCGACACCCTGAGTGGCGGCA 14 8 

Qy 2 91 tgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacggactg 350 

I I I I II I I I I I I III I I I I I I I I I I I I I I I 

Db 14 9 GCGGCAGCGACATCTTCCGCTTCGGCGGCGCGCTCGACAGCTTCCGCAACTACGCCAGCG 208 

Qy 351 ctgccaaggacgcc 364 

I I I I I I I 
Db 209 "GGACGAACGGCACC 222 



RESULT 7 
US-08-387-942C-1 

Sequence 1, Application US/08387942C 
Patent No. 5939289 
GENERAL INFORMATION: 

APPLICANT: ERTESVAG, HELGA 
APPLICANT: VALLA, SVEIN 
APPLICANT: SKJAK-BRAEK, GUDMUND 
APPLICANT: LARSEN, BJORN 

TITLE OF INVENTION: DNA COMPOUNDS COMPRISING SEQUENCES 
TITLE OF INVENTION: ENCODING MANN U RON AN C-5-EPIMERASE 
NUMBER OF SEQUENCES: 52 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: BIRCH, STEWART, KOLASCH & BIRCH, LLP 
STREET: P.O.BOX 747 
CITY: FALLS CHURCH 
STATE: VA 
COUNTRY: USA 
ZIP: 22042 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /38 7 , 94 2C 
FILING DATE: 09-MAY-1995 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: MURPHY JR, GERALD M. 
REGISTRATION NUMBER: 28,977 
REFERENCE/DOCKET NUMBER: 1809-106P 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 703-205-8000 
TELEFAX: 7 03-205-8 050 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 12588 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
ORIGINAL SOURCE: 

ORGANISM: Azotobacter vinelandii 



STRAIN : E 
FEATURE : 

NAME/KEY: CDS 
LOCATION: 290. . 1951 
FEATURE: 

NAME /KEY : CDS 
LOCATION: 2227 . . 6438 
FEATURE: 

NAME/KEY: CDS 
LOCATION: 6702 . . 9695 
FEATURE: 

NAME/KEY: CDS 
LOCATION: 9973. . 1258E 
US-08-387-942C-1 



Query Match 8.2%; Score 34.6; DB 2; Length 12588; 

Best Local Similarity 48.5%; Pred. No. 1.8; 

94; Conservative 0; Mismatches 100; Indels 0; Gaps 0; 

ttgccggacccagcgagggggtggcggcgatctccaaaggggggaaggtanggccgaacc 230. 
I I I I I I I I I I I I I 111 III I I Mill I I I I I 



I II II I I I III I I I I I II II I MM 



II II I I I I I I III II II II I I I I I I I I I 



Matches 


Qy 


171 


Db 


9205 


Qy 


231 


Db 


9265 


Qy 


291 


Db 


9325 


Qy 


351 


Db 


9385 



I 1 1 I I 1 1 



RESULT 8 * 
US-09-167-717-3 

; Sequence 3, Application US/09167717 

; Patent No. 6133014 

; GENERAL INFORMATION: 

; APPLICANT: MUKOUYAMA, MASAHARU 

; APPLICANT: YASUDA, SHINZO 

; APPLICANT: KOMATSUZAKI , SATOMI 

TITLE OF INVENTION: MALEATE ISOMERASE GENE 
FILE REFERENCE: PH-555US 
; CURRENT APPLICATION NUMBER: US/09/167,717 
; CURRENT FILING DATE: 1998-10-07 
; EARLIER APPLICATION NUMBER: JP 276261/1997 
; EARLIER FILING DATE: 1997-10-08 
; NUMBER OF SEQ ID NOS : 14 
; SEQ ID NO 3 

LENGTH: 7 50 

TYPE: DNA 

ORGANISM: Pseudomonas fluorescens 
FEATURE: 



NAME/KEY: CDS 
LOCATION: (1) . . (750) 
US-09-167-717-3 



Query Match 8.2%; Score 34.4; DB 3; Length 750; 

Best Local Similarity 48.9%; Pred. No. 0.76; 

Matches 92; Conservative 0; Mismatches 96; Indels 0; Gaps 0; 

Qy 14 gtccggcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagc 7 3 

I I I I I I I I I I I I I I I M I I I I I I I I III I I I I I 

Db 511 gcccgtcacgacacggccaacctgccgggcatcgtcgccggcatgaaccttgagggcgtc 57 0 

Qy 7 4 gacgccgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagtt 133 

II I II II II I I I II I - II II Ml I II 

Db 571 gatgtggtggtgctttcagcctgcgtgcagatgcagtcgctgccggcagtcgccaaggtc 630 

Qy 134 gagttgcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtg 193 

Ml MINI II I I I I I I I I I I I I II II II 
Db 631 gaggcgcaaaccggcaaaccggtggtcaccgctgccatcgccaccacctacgccatgctc 690 

Qy 194 gcggcgat 201 
II I I I 

Db 691 aaggcgct 698 



RESULT 9 
US-09-036-987A-1 

Sequence 1, Application US/09036987A 
Patent No. 6143526 
GENERAL INFORMATION: 

APPLICANT: Baltz, Richard H. 
APPLICANT: Broughton, Mary C. 
APPLICANT: Crawford, Kathryn P. 
APPLICANT: Madduri, Krishnamurthy 
APPLICANT: Merlo, Donald J. 
APPLICANT: Treadway, Patti J. 
APPLICANT: Turner, Jan R. 
APPLICANT: Waldron, Clive 

TITLE OF INVENTION: Biosynthetic Genes For Spinosyn Insecticide 
TITLE OF INVENTION: Production 
NUMBER OF SEQUENCES: 39 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Dow AgroSciences LLC Patent Department 
STREET: 9330 Zionsville Road 
CITY: Indianapolis 
STATE: Indiana 
COUNTRY: USA 
ZIP: 46268 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/036, 987A 
FILING DATE: 09-MAR-1998 



CLASSIFICATION: 435 
ATTORNEY /AGENT INFORMATION: 

NAME: Stuart, Donald R 

REGISTRATION NUMBER: 28,479 

REFERENCE/DOCKET NUMBER: 50,608 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (317)337-4 816 

TELEFAX: (317)337-4847 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 80161 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
US-09-036-987A-1 



Query Match 7.9%; Score 33.2; DB 3; Length 80161; 

Best Local Similarity 55.1%; Pred. No. 8.8; 

Matches 65; Conservative 0; Mismatches 53; Indels 0; Gaps 0; 

Qy 23 gatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgccgtc 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I II I 
Db 21852 GATCAGGCGCACGCCGATGGCGACCGGATCTACTGCCTCATCCGCGGCAGCGCGGTCAAC 21911 

Qy 83 atggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagttgc 140 

1 II II I I I I I I III I III I I I I I I I I I I I I 

Db 21912 AACGATGGGGGCGGTGCCGGGCTCACCGTTCCGGCGGCGGACGCCCAGGCGGAGCTGC 21969 



RESULT 10 
US-09-370-700-1 

; Sequence 1, Application US/09370700 

; Patent No. 6274350 

; GENERAL INFORMATION: 

; APPLICANT: Baltz, Richard H 

; APPLICANT: Broughton, Mary C 

; APPLICANT: Crawford, Kathryn P 

; APPLICANT: Madduri, Krishnamurthy 

APPLICANT: Treadway, Patti J 

APPLICANT: Turner, Jan R 
; APPLICANT: Waldron, Clive 

TITLE OF INVENTION: Biosynthetic Genes For Spinosyn Insecticide 
; FILE REFERENCE: 50489 DIV1 

; CURRENT APPLICATION NUMBER: US/09/370,700 

; CURRENT FILING DATE: 1999-08-09 

; EARLIER APPLICATION NUMBER: US 09/36987 

; EARLIER FILING DATE: 1998-03-09 

; NUMBER OF SEQ ID NOS : 39 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 1 

LENGTH: 80161 

TYPE: DNA 

; ORGANISM: Saccharopolyspora spinosa 
US-09-370-700-1 



Query Match 7.9%; Score 33.2; DB 4; Length 80161; 

Best Local Similarity 55.1%; Pred. No. 8.8; 

Matches 65; Conservative 0; Mismatches 53; Indels 0; Gaps 



0; 



Qy 23 gatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgccgtc 82 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 21852 gatcaggcgcacgccgatggcgaccggatctactgcctcatccgcggcagcgcggtcaac 21911 

Qy 83 atggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagttgc 140 

I II II I I I I I I II I I III I I I I I I I I I I I I 

Db 21912 aacgatgggggcggtgccgggctcaccgttccggcggcggacgcccaggcggagctgc 21969 



RESULT 11 
US-09-103-840A-2 

; Sequence 2, Application US/09103840A 

; Patent No. 6294328 

; GENERAL INFORMATION: 

; APPLICANT: FLEISCHMAN, Robert D. 

; APPLICANT: WHITE, Owen R. 

; APPLICANT: FRASER, Claire M. 

; APPLICANT: VENTER, John C. 

; TITLE OF INVENTION: DNA SEQUENCES FOR STRAIN ANALYSIS IN MYCOBACTERIUM 
TITLE OF INVENTION: TUBERCULOSIS 
FILE REFERENCE: 24 366-20007.00 
; CURRENT APPLICATION NUMBER: US/09/103 , 8 4 OA 
; CURRENT FILING DATE: 1998-06-24 
; NUMBER OF SEQ ID NOS : 2 
; SOFTWARE: Patent In Ver. 2.1 
; SEQ ID NO 2 

LENGTH: 4403765 

TYPE: DNA 

; ORGANISM: Mycobacterium tuberculosis 
FEATURE: 

OTHER INFORMATION: CDC 1551 

OTHER INFORMATION: "n" bases at various positions throughout the sequence 
OTHER INFORMATION: represent a, t, c or g 
US-09-103-840A-2 



Query Match 7.9%; Score 33.2; DB 4; Length 4403765; 

Best Local Similarity 47.0%; Pred. No. 26; 

Matches 102; Conservative 0; Mismatches 115; Indels 0; Gaps 0; 

Qy 179 cccagcgagggggtggcggcgatctccaaaggggggaaggtanggccgaaccggtgcagc 238 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 1383125 ccccgcgagcggttctcggcgctcacccggctggaccacaatcgggcgatctcgcagctg 1383184 

Qy 239 gcctgcaggaagagggttggacttacgggattcaactgccggtgtgggaacttgtactgc 298 

I I I I I I I III I I I I I I I I I I I 

Db 1383185 gccgccaagaccggcgcggcggtcaccgacatcaagaagatgacgatctggggcaatcac 1383244 

Qy 2 99 gcactccaccgctactccgacaagcacgactgcaagttcgactaacggactgctgccaag 358 

I I I MINIM I III I I I I I I I II II I I I I 
Db 1383245 tcggccacccagtaccccgacctgttccacgcggaggtcgccggaaagaacgcggccgaa 1383304 



Qy 359 gacgccattgccaagggctaatccggtggtgaaggca 395 

I III I I III I I II I I I I I II 
Db 1383305 gtggtcaacgaccaggcctggatcgaggatgaattca 1383341 



RESULT 12 
US-08-981-392-24 

Sequence 24, Application US/08981392 
Patent No. 6262025 
GENERAL ' INFORMATION : 

APPLICANT: Ish-Horowicz, David 
APPLICANT: Henrique, Domingos Manuel Pinto 
APPLICANT: Lewis, Julian Hart 
APPLICANT : Artavanis-Tsakonas , Spyridon 
APPLICANT: Gray, Grace 

TITLE OF INVENTION: NUCLEOTIDE AND PROTEIN SEQUENCES 

TITLE OF INVENTION: OF VERTEBRATE DELTA GENES AND METHODS BASED THEREON 
NUMBER OF SEQUENCES: 94 
CORRESPONDENCE ADDRESS: 
'ADDRESSEE: Pennie & Edmonds LLP 
STREET: 1155 Avenue of the Americas 
CITY: New York 
STATE: NY 
COUNTRY: USA 
ZIP: 10036/2711 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US /08 / 98 1 , 3 92 
FILING DATE: 22-DEC-1997 
CLASSIFICATION: 514 
ATTORNEY/AGENT INFORMATION: 
NAME: Antler, Adriane M. 
REGISTRATION NUMBER: 32,605 
REFERENCE/DOCKET NUMBER: 7326-038 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 212-7 90-9090 
TELEFAX: 212-869-8864 
TELEX: 66141 PENNIE 
INFORMATION FOR SEQ ID NO: 24: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2899 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: DNA 
US-08-981-392-24 



Query Match 7.8%; Score 33; DB 4; Length 2899; 

Best Local Similarity 35.3%; Pred. No. 3; 

Matches 78; Conservative 31; Mismatches 112; Indels 0; Gaps 0; 



Qy 147 tgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggcgatctcca 206 



III. 1*111 1*11 • -1*1 -III I * 111*1 I 1*1 • I I I 

Db 17 50 TGARGCTRCAGAARCACCRGCCYCCASCYGAMCCCTGNSGGGGRGAGACRGARACCATGA 1809 

Qy 207 aaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttggacttacgg 2 66 

I : I I I : I I : : I II I : I I : I I : 

Db 1810 ACAACCTRGNCAAYTGCCAGCGYGAGAAGGACRTYTCWGTYAGCATCATYGGGGNYACSC 18 69 

Qy 2 67 gattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccgacaagcacg 32 6 

MM I III I III: III: M : I : M I : : : 

Db 187 0 ACATCAAGAACACCAACAAGAAGGCGGACTTYCACGGGGACCAYRGNGCCRASAAGARYR 1929 

Qy 327 actgcaagttcgactaacggactgctgccaaggacgccatt 367 

II : I II : I : II I : : I III II I I : 
Db 1930 GCTTYAAGGYCCGMTACCCMRNKGTGGACTATAACCTCGTK 1970 



RESULT 13 
US-08-470-058-3 

; Sequence 3, Application US/08470058 
; Patent No. 5817789 
; GENERAL INFORMATION: 

APPLICANT: Heartlein, Michael W. 

APPLICANT: Lemontt, Jeffrey F. 

TITLE OF INVENTION: Chimeric Proteins For Use in Transport 
TITLE OF INVENTION: of a Selected Substance Into Cells 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Hamilton, Brook, Smith & Reynolds, P.C. 
; STREET: Two Militia Drive 

CITY: Lexington 

STATE : MA 

COUNTRY: USA 

ZIP : 02173 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS /MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/470,058 

FILING DATE: 06-JUN-1995 

CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 
; NAME: Granahan, Patricia 

REGISTRATION NUMBER: 32,227 

REFERENCE/DOCKET NUMBER: TKT93-01 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617-861-6240 

TELEFAX: 617-861-9540 
; INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 4601 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: both 

TOPOLOGY: both 
MOLECULE TYPE: cDNA to mRNA 
US-08-470-058-3 



Query Match 7.7%; Score 32.4; DB 1; Length 4601; 

Best Local Similarity 58.2%; Pred. No. 5.3; 

Matches 57; Conservative 0; Mismatches 41; Indels 0; Gaps 0; 



Qy 20 caggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgcc 79 

I I I I I II I I I I I I I I II I I I I I M I I I I I I I I I I I.I I 
Db 1433 CAGGCCCCCGACGGGCTGGCTGTGGACTGGATCCACAGCAACATCTACTGGACCGACTCT 1492 

Qy 80 gtcatggagccggttgttgctggcagcaacacggtagt 117 

I I I I I I III I I I I I I I I I II 

Db 1493 GTCCTGGGCACTGTCTCTGTTGCGGATACCAAGGGCGT 1530 



RESULT 14 
US-09-037-188-3 

; Sequence 3, Application US/09037188 

; Patent No. 6027921 

; GENERAL INFORMATION: 

APPLICANT: Heartlein, Michael W. 

APPLICANT: Lemontt, Jeffrey F. 

APPLICANT: Concino, Michael F. 

TITLE OF INVENTION: CHIMERIC PROTEINS FOR USE 

TITLE OF INVENTION: IN TRANSPORT OF A SELECTED SUBSTANCE INTO CELLS 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 

STREET: 225 Franklin Street 
; CITY: Boston 

STATE: MA 

COUNTRY: USA 

ZIP: 02110-2804 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: Windows95 
; SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/037,188 

FILING DATE: 02-MAR-1998 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 06-JUN-1995 
ATTORNEY/AGENT INFORMATION: 

NAME: Fraser, Ph.D., J.D., Janis K. 

REGISTRATION NUMBER: 34,819 

REFERENCE/DOCKET NUMBER: 07236/009002 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617/542-5070 

TELEFAX: 617/542-8906 

TELEX: 200154 
; INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 4601 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: both 



TOPOLOGY: both 
MOLECULE TYPE: cDNA to mRNA 
US-09-037-188-3 



Query Match 7.7%; Score 32.4; DB 3; "Length 4 601; 

Best Local Similarity 58.2%; Pred. No. 5.3; 

Matches 57; Conservative 0; Mismatches 41; Indels 0; Gaps 0; 

Qy 20 caggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgcc 79 

I I I I I II I I I I I II I II I I I I I I II I I I I I I I I I I I I 
Db 1433 CAGGCCCCCGACGGGCTGGCTGTGGACTGGATCCACAGCAACATCTACTGGACCGACTCT 1492 

Qy 80 gtcatggagccggttgttgctggcagcaacacggtagt 117 

I I I I I I III I I I I I I I I I M 

Db 14 93 GTCCTGGGCACTGTCTCTGTTGCGGATACCAAGGGCGT 1530 



RESULT 15 
US-09-285-310-3 

; Sequence 3, Application US/09285310 
; Patent No. 6262026 
; GENERAL INFORMATION: 

APPLICANT: Heartlein, Michael W. 

APPLICANT: Lemontt, Jeffrey F. 
; APPLICANT: Concino, Michael F. 

TITLE OF INVENTION: CHIMERIC PROTEINS FOR USE 

TITLE OF INVENTION: IN TRANSPORT OF A SELECTED SUBSTANCE INTO CELLS 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 

STREET: 225 Franklin Street 
; CITY: Boston 

STATE: MA 

COUNTRY: USA 

ZIP : 02110-2804 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: Windows95 
; SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/285,310 

FILING DATE: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/037,188 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: Fraser, Ph.D., J.D., Janis K. 

REGISTRATION NUMBER: 34,819 

REFERENCE/DOCKET NUMBER: 07236/009002 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617/542-5070 

TELEFAX: 617/542-8906 

TELEX: 200154 
; INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 



LENGTH: 4601 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : both 
TOPOLOGY: both 
MOLECULE TYPE: cDNA to mRNA 



Query Match 7.7%; Score 32.4; DB 4; Length 4601; 

Best Local Similarity 58.2%; Pred. No. 5.3; 

Matches 57; Conservative 0; Mismatches 41; Indels 0; Gaps 0; 

Qy 20 caggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgcc 7 9 

I II I I II I I I I I I I I II I I I I I I I I I I I I I I I I II I I 
Db 14 33 CAGGCCCCCGACGGGCTGGCTGTGGACTGGATCCACAGCAACATCTACTGGACCGACTCT 14 92 

Qy 80 gtcatggagccggttgttgctggcagcaacacggtagt 117 

I I I I I I III I 1 I I I I I I I II 

Db 14 93 GTCCTGGGCACTGTCTCTGTTGCGGATACCAAGGGCGT 1530 



Search completed: February 7, 2002, 11:12:10 
Job time: 7296 sec 

GenCore version 4.5 

Copyright (c) 1993 - 2000 Compugen Ltd. 

OM nucleic - nucleic search, using sw model 

Run on: February 7, 2002, 08:20:48 ; Search time 4942.22 Seconds 

(without alignments) 
915.373 Million cell updates/sec 

Title: US-09-394-745-6489 
Perfect score: 421 

Sequence: 1 ggggccagcacgcgtccggc ctcgacaagatctaaggggg 421 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 11351937 seqs, 5372889281 residues 

Total number of hits satisfying chosen parameters: 22703874 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : EST: 
1 
2 
3 



em_est f un : * 
em_esthum: * 
em estin:* 



4: em_estom:* 

5: em_estpl:* 

6: em_estba : * 

7: em_estro: * 

8: em_estov:* 

9: em_htc:* 
10: gb_estl:* 
11: gb_est2:* 
12: gb_htc:* 
13: gb_gss:* 
14: em_gss_f un : * 
15: em_gss_hum: * 
16: em__gss_inv: * 
17: em_gss_pln : * 
18: em_gss_pro : * 
19: em_gss_rod: * 
20: em__gss_vrt : * 
21: em_gss_other : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 


c 


1 


385.2 


91 


5 


635 


11 


BG841717 


BG841717 


MEST24-E1 




2 


378.8 


90 


0 


528 


10 


BE012241 


BE012241 


945015H01 


c 


3 


378.8 


90 


0 


629 


10 


AW506899 


AW506899 


660060H09 




4 


367.4 


87 


3 


602 


10 


AW928018 


AW928018 


945006F08 




5 


362.2 


86 


0 


477 


10 


BE640589 


BE640589 


945020B04 




6 


351.8 


83 


6 


518 


10 


BE129631 


BE129631 


945027C01 




7 


348.8 


82 


9 


594 


10 


AI881804 


AI881804 


606073E02 




8 


331.4 


78 


7 


488 


11 


BG355155 


BG355155 


947043D10 


c 


9 


324 .8 


77 


1 


482 


10 


BE510926 


BE510926 


946056E05 




10 


312.8 


74 


3 


567 


10 


AW927909 


AW927909 


945010H04 


c 


11 


290.6 


69 


0 


643 


10 


AI977903 


AI977903 


496035D05 


c 


12 


290 


68 


9 


514 


10 


AW061851 


AW061851 


687007F12 




13 


287.4 


68 


3 


527 


10 


AI649645 


AI649645 


486077B07 


c 


14 


276.8 


65 


7 


599 


10 


BE129734 


BE129734 


946003G07 




15 


264 .2 


62 


8 


660 


11 


W99253 


W99253 EST01 Gluco 




16 


263.8 


62 


7 


598 


10 


AW924551 


AW924551 


WS1 70 F0 




17 


260.6 


61 


9 


582 


10 


BE361054 


BE361054 


DG1 69 E0 




18 


259.6 


61 


7 


595 


11 


BI246082 


BI246082 


IP1 68 El 




19 


259.4 


61 


6 


479 


11 


BE918267 


BE918267 


OV1 1 C02 




20 


258 


61 


3 


549 


10 


AW231809 


AW231809 


687042A08 




21 


256.6 


61 


0 


547 


11 


BE918046 


BE918046 


OV1 1 C02 




22 


255.4 


60 


7 


586 


10 


BE363253 


BE363253 


WS1 61 F0 


c 


23 


254 .4 


60 


.4 


608 


10 


BE055927 


BE055927 


945020B03 


c 


24 


254 .2 


60 


.4 


612 


10 


AI734759 


AI734759 


606036A08 




25 


253.4 


60 


.2 


662 


11 


BG932962 


BG932962 


DG1 69 E0 


c 


26 


247.2 


58 


.7 


594 


10 


AI622011 


AI622011 


486031B05 


c 


27 


245 


58 


.2 


651 


10 


AI861117 


AI861117 


603012D01 




28 


244 


58 


.0 


555 


10 


AW680999 


AW680999 


WS1 9 D08 




29 


243.6 


57 


.9 


662 


11 


BG464005 


BG464005 


EMI 52_E0 





30 


9/1 c 
241.0 


D / . 


. 4 


a a q 
64 y 


1 1 
1 1 




nb4 Dj<£ o 1 


TTM1 1A PI 


c 


31 


O A 1 A 

241.4 


C 1 


, 3 


C Q 9 


1U 


ai o y i o / y 


7\T£Cn Q7Q 
Al D y 1 o / y 


c n i in i 


c 


O O 

32 


23o 


C £1 

OD . 


. 5 


•JOT 

32 / 


10 


7\ r.i A no A CO 

AW 4 J o 4 0 Z 


a r»7 a on A CO 
AW 4 J J 4 0 Z 


/u /uoynuo 




"3 o 

33 


O 1 A O 

234.o 


c c 
DO , 


o 
, 0 


A Ci 1 

4 y i 


1U 


7\ Tat q o a ccc; 
aw y Z 4 0 0 0 


n tvt q o /i c £ c 

aw yz 4 DOj 


Wol / U tU 




o yi 


232.4 


c c 
30 . 


o 

, 2 


/ 9 9 
4 / 3 


1U 


bbUOl / / 1 


DCjU 0 1 / / 1 


zao / nu o . g 


c 


o c 

35 


230.6 


C j1 

54 . 


. 8 


4 6/ 


10 


tv TO C1 1 CO 

AIo ol 1 o2 


ATQC1 1 C 9 

Al obll oZ 


DU jUI / dUo 




36 


O O 1 

221 


c o 

52 , 


r 

, D 


231 


1 U 


BE64 UOO 0 


DLtD 4 Ujoo 


y 4 duzudUj 




37 


219 


52 . 


, 0 


coo 

53o 


1 1 


BG3i 3o yy 


npO 1 9 Q Q Q 


WnhZUOO U 




o o 

38 


O 1 T 

.21/ 


c t 
51 . 


, 5 


c o c 

o2 b 


1 1 


Bb JO OZ (JO 


J D DZ U 0 


DMi 99 pn 
LLY11 ZZ uU 




39 


216.2 


C 1" 

51 . 


, 4 


62U 


1 1 


Bb 4 / JU / 4 


DC 4 / JU / 4 


w u t? n o o n 
WniljUyZO u 




40 


216.2 


51. 


, 4 


693 


10 


BE585592 


BE585592 


EST#5PSP6 




41 


215 


51. 


, 1 


571 


10 


BE420025 


BE420025 


WWS02 . C8R 




42 


213.6 


50. 


.7 


260 


10 


BE056027 


BE056027 


945017E04 


c 


43 


211.6 


50. 


.3 


830 


10 


BE705552 


BE705552 


Sc01_03a0 




44 


211 


50. 


.1 


664 


10 


BE431005 


BE431005 


SUN010.C0 


c 


45 


210.8 


50. 


.1 


715 


10 


BE705540 


BE705540 


ScOl 02h0 



ALIGNMENTS 



RESULT 1 
BG841717/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BG841717 635 bp mRNA EST 29-MAY-2001 

MEST24-E10.T3 ISUM4-TN Zea mays cDNA clone MEST24-E10 3 1 , mRNA 
sequence . 
BG841717 

BG841717.2 GI:14244031 
EST. 

Zea mays . 
Zea mays 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae; Zea. 
1 (bases 1 to 635) 

Qiu, F. , Cui,F., Guo,L., Ashlock,D.A, Wen, T.J. and Schnable, P . S . 
Expressed Sequence Tags from B73 Maize Seedlings and Silks 
Unpublished (2001) 

On May 25, 2001 this sequence version replaced gi:14208039. 
Contact: Patrick S. Schnable 
Schnable Laboratory 
Iowa State University 

G405 Agronomy, Iowa State University, Ames, IA 50011-1010, USA 

Tel: 515-294-0975 

Fax: 515-294-2299 

Email : schnable@iastate . edu 

PCR PRimers 

FORWARD: T7-1 {AA TAC GAC TCA CTA TAG) 

BACKWARD: T3 (ATT AAC CCT CAC TAA AG) 

Seq primer: primer T3 (ATT AAC CCT CAC TAA AG) . 

Location/Qualifiers 

1. .635 

/organism="Zea mays" 
/cultivar="B73" 
/db_xref="taxon:4577" 
/clone="MEST24-E10" 
/clone lib="ISUM4-TN" 



/tissue_type="Seedling and silk" 
/lab_host="DH10B" 

/note="Vector: pT7T3PAC; Site_l: EcoRI; Site_2: NotI; 
ds-cDNA molecules were generated as follows. First-strand 
cDNA was prepared from oligo-dT selected mRNA by priming 
with a NotI oligo-dT primer (5' 

AACTGGAAGAATTCGCGGCCGCAGGAATTTTTTTTTTTTTTTTTT) . The 
resulting DN A : RNA hybrid was treated with RNase H and used 
as a template for DNA Poll-catalyzed second strand 
synthesis. After the addition of EcoRI adaptors, the 
ds-cDNAs were digested with NotI and size-selected. The 
resulting molecules were directionally cloned into the 
EcoRI and NotI sites of the pT7T3PAC vector. The library 
then went through one round of normalization to CoT value 
of 5 based on the methods of Marcelo Bento Soares (Genome 
Research 6: 791-806, 1996)." 

BASE COUNT 132 a 190 c 166 g 147 t 

ORIGIN 



Query Match 91.5%; Score 385.2; DB 11; Length 635; 

Best Local Similarity 98.8%; Pred. No. 4e-82; 



Matches 


398; Conservative 0; Mismatches ' 4; Indels 1; Gaps 


Qy 


19 


gcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgc 


78 






1 II 1 1 1 I 1 1 11 1 1 1 1 1 1 1 1 I 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 1 1 




Db 


630 


GCAGGATCAGGCCAAGCTGGCTGCCTCCTCTATCGACAGCATCGTGAACGGCAGCGACGC 


571 


Qy 


79 


cgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagtt 


138 






1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 




Db 


570 


CGTCATGGAGCCGGTTGTTGCTGGCAGCAACACGGTAGTAGCTGTTGCCCAAGTTGAGTT 


511 


Qy 


139 


gcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggc 


198 






1 1 1 1 1 1 1 1 1 1 1 II 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


510 


GCAAACAATGAACGTGCAGCAGCCCGCTGATGTTGCCGGACCCAGCGAGGGGGTGGCGGC 


451 


Qy 


199 


gatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttgg 


258 






1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


450 


GATCTCCAAAGGGGGGAAGGTAGGGCCGAACCGGTGCAGCGCCTGCAGGAAGAGGGTTGG 


391 


Qy 


259 


acttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccga 


318 






1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


390 


ACTTACGGGATTCAACTGCCGGTGTGGGAACTTGTACTGCGCACTCCACCGCTACTCCGA 


331 


Qy 


319 


caagcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaagggcta 


378 






1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


330 


CAAGCACGACTGCAAGTTCGACTACCGGACTGCTGCCAGGGACGCCATTGCCAA-GGCTA 


272 


Qy 


379 


atccggtggtgaaggcagacaagctcgacaagatctaaggggg 421 








1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


271 


ATCCGGTGGTGAAGGCAGACAAGCTCGACAAGATCTAGGGGGG 22 9 





RESULT 2 
BE012241 

LOCUS BE012241 528 bp mRNA EST 06-JUN-2000 

DEFINITION 945015H01.Y1 945 - Mixed adult tissues from Walbot lab, same as 707 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



(SK) Zea mays cDNA, mRNA sequence. 
BE012241 

BE012241.1 GI:8273149 
EST. 

Zea mays. 
Zea mays 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta ; Liliopsida ; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae; Zea. 
1 (bases 1 to 528) 
Walbot, V. 

Maize ESTs from various cDNA libraries sequenced at Stanford 
University 
Unpublished (1999) 
Contact: Walbot V 

Department of Biological Sciences 
Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 

Tel: 650 723 2227 

Fax: 650 725 8221 

Email : walbot0stanford.edu 

Plate: 945015 row: H column: 01. 

Location/Qualifiers 

1. .528 

/organism="Zea mays" 
/cultivar="W23" 
/db_xref="taxon: 4577" 

/clone_lib="945 - Mixed adult tissues from Walbot lab, 
same as 707 (SK) " 
/tissue_type="tassel, kernal, 
/dev_stage=" fully-grown" 
/lab_host="DH10B" 
/note="Organ: tassel, kernal, 

Vector: pGADIO; Site_l: EcoRI; cDNA library from fully 
differentiated maize tissues from an active Mutator plant 
Tissue ratio is 4:2:1:1:1:1 (tassel, 
root, leaf) . Unidirectionally cloned 
given to library 707 for additional sequencing 
120 a 146 c 176 g 86 t 



silk, husk, root, leaf" 



silk, husk, root, leaf; 



kernel, silk, husk, 
New library number 



Query Match 90.0%; Score 378.8; DB 10; Length 528; 

Best Local Similarity 97.8%; Pred. No. 1.3e-80; 

Matches 394; Conservative . 0; Mismatches 8; Indels 1; Gaps 

19 gcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgc 7 8 

I I I I II I I I I I I I I I II I I E I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I M I I I I 
Db 127 GCAGGATCAGGCCAAGCTGGCTGCCTCCTCTATCGACAGCATCGTGAACGGCAGCGACGC 18 6 



Qy 



1; 



Qy 

Db 

Qy 

Db 



79 



187 



cgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagtt 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I MIM 
CGTCATGGAGCCGGTTGTTGCTGGCAGCAACACGGTAGTAGCTGTTGCCCAAGTCGAGTT 



138 



246 



139 gcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggc 198 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
24 7 GCAAACAATGAACGTGCAGCAGCCCGCTGATGTTGCCGGACCCAGCGAGGGGGTGGCGGC 30 6 



Qy 199 gatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttgg 258 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 307 GATCTCCAAAGGGGGGAAGGTAGGGCCGAACCGGTGCAGCGCCTGCAGGAAGAGGGTTGG 366 

Qy 259 acttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccga 318 

II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 367 GCTCACGGGATTCAACTGCCGGTGTGGGAACTTGTACTGCGCGCTCCACCGCTACTCCGA 4 26 

Qy 319 caagcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaagggcta 37 8 

I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I I I 
Db 427 CAAGCACGACTGCAAGTTCGACTACCGGACTGCTGCCAGGGACGCCATTGCCAA-GGCTA 4 85 

Qy 379 atccggtggtgaaggcagacaagctcgacaagatctaaggggg 421 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I 
Db 48 6 ATCCGGTGGTGAAGGCAGACAAGCTCGACAAGATCTAGGGGGG 528 



RESULT 3 
AW506899/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



AW506899 629 bp mRNA EST 03-MAR-2000 

660060H09.xl 660 - Mixed stages of anther and pollen Zea mays cDNA, 
mRNA sequence. 
AW506899 

AW506899.1 GI:7145418 
EST. 

Zea mays . 
Zea mays 

Eukaryota ; Vir idiplantae ; St reptophyt a ; Embryophyta ; Tracheophyt a ; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae ; Zea. 
1 (bases 1 to 629) 
Walbot, V. 

Maize ESTs from various cDNA libraries sequenced at Stanford 
University 
Unpublished (1999) 
Contact: Walbot V 

Department of Biological Sciences 
Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 
Tel: 650 723 2227 
Fax: 650 725 8221 
Email : walbot@stanford.edu 
Plate: 660060 row: H column: 09. 
Locat ion /Qualifiers 
1. .629 

/organism= n Zea mays" 
/cultivar="Ohio4 3" 
/db_xref= n taxon: 4577" 

/clone_lib=" 660 - Mixed stages of anther and pollen" 
/tissue_type="whole premieotic anthers to pollen shed" 
/dev_stage="premieotic anthers to pollen shed" 
/lab_host="XLOLR" 

/note="Organ: anthers; Vector: Lambda Zap; Site_l: EcoRI; 
Site_2: Xhol; Anther and pollen cDNA library. 
Directionally sequenced with 5 1 end at the EcoRI site. 
Created by Amie Franklin." 



BASE COUNT 137 a 193 c 171 g 128 t 

ORIGIN 



Query Match 90.0%; Score 378.8; DB 10; Length 629; 

Best Local Similarity 97.8%; Pred. No. 1.4e-80; 



Matches 


394; Conservative 0; Mismatches 8; Indels 1; Gaps 


Qy 


19 


gcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgc 


78 






1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 t 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 I 1 M I M 1 1 I 1 1 1 1 1 1 1 i I 1 1 i 1 1 1 1 1 1 I 1 1 1 1 




Db 


622 


GCAGGATCAGGCCAAGCTGGCTGCCTCCTCTATCGACAGCATCGTGAACGGCAGCGACGC 


563 


Qy 


79 


cgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagtt 


138 






i i i t i i i i i i I I I I I I I 1 1 I t I I I t I I 1 1 1 1 I t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 
II M 1 1 1 1 I 1 1 II II II 1 1 II 1 II M 1 1 1 II ! 1 II II 1 1 1 1 1 1 II 1 1 II 1 II II Mill 




Db 


562 


CGTCATGGAGCCGGTTGTTGCTGGCAGCAACACGGTAGTAGCTGTTGCCCAAGTCGAGTT 


503 


Qy 


139 


gcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggc 


198 






1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 l 1 II 1 1 1 1 1 1 1 

1 1 M 1 1 1 1 1 II 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 I I I I I 1 1 1 1 1 1 1 i i I i 




Db 


502 


GCAAACAATGAACGTGCAGCAGCCCGCTGATGTTGCCGGACCCAGCGAGGGGGTGGCGGC 


443 


Qy 


199 


gatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttgg 


258 






1 I I I I M 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


442 


GATCTCCAAAGGGGGGAAGGTAGGGCCGAACCGGTGCAGCGCCTGCAGGAAGAGGGTTGG 


383 


Qy 


259 


acttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccga 


318 






II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 




Db 


382 


GCTCACGGGATTCAACTGCCGGTGTGGGAACTTGTACTGCGCGCTCCACCGCTACTCCGA 


323 


Qy 


319 


caagcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaagggcta 


378 






II I 1 I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 M 1 1 M 1 




Db 


322 


CAAGCACGACTGCT^AGTTCGACTACCGGACTGCTGCCAGGGACGCCATTGCCAA-GGCTA 


264 


Qy 


379 


atccggtggtgaaggcagacaagctcgacaagatctaaggggg 421 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


263 


ATCCGGTGGTGAAGGCAGACAAGCTCGACAAGATCTAGGGGGG 221 





RESULT 4 

AW928018 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



AW928018 602 bp mRNA EST 30-MAY-2000 

945006F08.Y1 945 - Mixed adult tissues from Walbot lab, same as 707 
(SK) Zea mays cDNA, mRNA sequence. 
AW928018 

AW928018.1 GI:8103377 
EST. 

Zea mays . 
Zea mays 

Eukaryota; Viridiplantae ; Streptophyta; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta ; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae; Zea. 
1 (bases 1 to 602) 
Walbot, V. 

Maize ESTs from various cDNA libraries sequenced at Stanford 
University 
Unpublished (1999) 
Contact: Walbot V 

Department of Biological Sciences 



FEATURES 

source 



CA 94304, USA 



08. 



BASE COUNT 
ORIGIN 



Stanford University 
855 California Ave, Palo Alto, 
Tel: 650 723 2227 
Fax: 650 725 8221 
Email: walbot@stanford.edu 
Plate: 945006 row: F column: 
Location/Qualifiers 
1. .602 

/organism="Zea mays" 
/cultivar="W23" 
/db_xref="taxon: 4577" 
/clone_lib="945 - Mixed adult 
same as 707 (SK) " 
/tissue_type=" tassel, kernal, 
/dev_stage=" fully-grown" 
/lab_host="DH10B" 
/note="Organ : tassel, kernal, 
Vector: pGADIO; Site_l: EcoRI; cDNA library from fully 
differentiated maize tissues from an active Mutator plant. 
Tissue ratio is 4:2:1:1:1:1 (tassel, kernel, silk, husk, 
root, leaf) . Unidirectionally cloned. New library number 
given to library 707 for additional sequencing." 
134 a 176 c 191 g 101 t 



tissues from Walbot lab, 
silk, husk, root, leaf" 

silk, husk, root, leaf; 



Query Match 87.3%; Score 367.4; DB^O; Length 602; 

Best Local Similarity 97.9%; Pred. No. 7e-78; 

Matches 382; Conservative 0; Mismatches 7; Indels 1; Gaps 

Qy 19 gcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgc 78 

I I I I I I I I I I I I I I I I I! I I I I I M I I I I I I I I I I I I I I I II M I M I I II I I I I I I I I I 
Db 214 GCAGGATCAGGCCAAGCTGGCTGCCTCCTCTATCGACAGCATCGTGAACGGCAGCGACGC 27 3 

Qy 79 cgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagtt 138 

M I M I M I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I M I I I I M I I I I I 
Db 27 4 CGTCATGGAGCCGGTTGTTGCTGGCAGCAACACGGTAGTAGCTGTTGCCCAAGTCGAGTT 333 

Qy 139 gcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggc 198 

I M I I I I I I I II I ! I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 334 GCAAACAATGAACGTGCAGCAGCCCGCTGATGTTGCCGGACCCAGCGAGGGGGTGGCGGC 393 

Qy 199 gatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttgg 258 

I I I I I I I I I I I I I I II I I I II I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 394 GATCTCCAAAGGGGGGAAGGTAGGGCCGAACCGGTGCAGCGCCTGCAGGAAGAGGGTTGG 4 53 

Qy 259 acttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccga 318 

M I I I I I I M I M II I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I 
Db 454 GCTCACGGGATTCAACTGCCGGTGTGGGAACTTGTACTGCGCGCTCCACCGCTACTCCGA 513 

Qy 319 caagcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaagggcta 37 8 

M I I I I I I II I I I I II II I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 514 CAAGCACGACTGCAAGTTCGACTACCGGACTGCTGCCAGGGACGCCATTGCCAA-GGCTA 572 

Qy 379 atccggtggtgaaggcagacaagctcgaca 408 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 573 ATCCGGTGGTGAAGGCAGACAAGCTCGACA 602 



RESULT 5 

BE640589 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BE640589 477 bp mRNA EST 30-AUG-2000 

945020B04.X1 945 - Mixed adult tissues from Walbot lab, same as 707 
(SK) Zea mays cDNA, mRNA sequence.. 
BE640589 

BE640589.1 GI:9954021 
EST. 

Zea mays. 
Zea mays 

Eukaryota; Viridiplantae; Streptophyta ; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae; Zea. 
1 (bases 1 to 477) 
Walbot, V. 

Maize ESTs from various cDNA libraries sequenced at Stanford 
University 
Unpublished (1999) 
Contact: Walbot V 

Department of Biological Sciences 
Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 

Tel: 650 723 2227 

Fax: 650 725 8221 

Email: walbot@stanford.edu 

Plate: 945020 row: B column: 04. 

Location/Qualifiers 

1. .477 

/organism="Zea mays" 
/cultivar="W23" 
/db_xref =" taxon : 4 577 " 

/clone_lib="945 - Mixed adult tissues from Walbot lab, 
same as 707 (SK) " 

/tissue_type="tassel, kernal, silk, husk, root, leaf" 

/dev_stage=" fully-grown" 

/lab_host="DH10B" 

/note="Organ : tassel, kernal, silk, husk, root, leaf; 
Vector: pGADIO; Site_l: EcoRI; cDNA library from fully 
differentiated maize tissues from an active Mutator plant. 
Tissue ratio is 4:2:1:1:1:1 (tassel, kernel, silk, husk, 
root, leaf) . Unidirectionally cloned. New library number 
given to library 707 for additional sequencing." 
109. a 130 c 151 g 87 t 



Query Match 86.0%; Score 362.2; DB 10; Length 477; 

Best Local Similarity 96.2%; Pred. No. 1.2e-76; 

Matches 381; Conservative 0; Mismatches 14; Indels 1; Gaps 1; 

Qy 19 gcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgc 78 

I I I I I I I E I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I 

Db 83 GCAGGATCAGGCCAAGCTGGCTGCCTCCTCTATCGACAGCATCGTGAATCTTTTCGACGC 142 



Qy 7 9 cgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagtt 138 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 3 CGTCATGGAGCCGGTTGTTGCTGGCAGCAACACGGTAGTAGCTGGTGCCCAAGTCGAGTT 202 

Qy 139 gcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggc 198 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I N I I I I I I 
Db 203 GCAAACAATGAACGTGCAGCAGCCCGCTGATGTTGCCGGACCCAGCGAGGGGGTGGCGGC 2 62 

Qy 199 gatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttgg 258 

I I I I I I I I I I II I I I II M I I I I I I I I I I I I I I II I I I I I I 1 I I I I I I II I I I I I I I I I 
Db 2 63 GATCTCCAAAGGGGGGAAGGTAGGGCCGAACCGGTGCAGCGCCTGCAGGAAGAGGGTTGG 322 

Qy 259 acttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccga 318 

II I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II 
Db 32 3 GCTCACGGGATTCAACTGCCGGTGTGGGAACTTGTACTGCGCGCTCCACCGCTACTCCGA 382 

Qy 319 caagcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaagggcta 378 

I I II I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I 
Db 383 CAAGCACGACTGCAAGTTCGACTACCGGACTGCTGCCAGGGACGCCATTGCCAA-GGCTA 4 41 

Qy 379 atccggtggtgaaggcagacaagctcgacaagatct 414 

I II I II I I I II I I I I I I I I I I I I I M M I II I I I I I 
Db 4 42 ATCCGGTGGT G AAG G C AG AC AAG C T C G AC AAG AT C T 477 



RESULT 6 

BE129631 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BE129631 518 bp mRNA EST 21-JUN-2000 

945027C01.X2 945 - Mixed adult tissues from Walbot lab, same as 7 
(SK) Zea mays cDNA, mRNA sequence. 
BE129631 

BE129631.1 GI:8576994 
EST. 

Zea mays. 
Zea mays 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae; Zea. 
1 (bases 1 to 518) 
Walbot, V. 

Maize ESTs from various cDNA libraries sequenced at Stanford 
University 
Unpublished (1999) 
Contact: Walbot V 

Department of Biological Sciences 
Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 

Tel: 650 723 2227 

Fax: 650 725 8221 

Email : walbot@stanford.edu 

Plate: 945027 row: C column: 01. 

Location/Qualifiers 

1. .518 

/organism="Zea mays" 
/cultivar="W23" 
/db_xref ="taxon : 4 577 " 

/clone_lib="945 - Mixed adult tissues from Walbot lab, 
same as 707 (SK) " 



/tissue_type="tassel, kernal, silk, husk, root, leaf" 

/dev_stage=" fully-grown" 

/lab_host="DH10B" 

/note="Organ: tassel, kernal, silk, husk, root, leaf; 
Vector: pGADIO; Site_l: EcoRI; cDNA library from fully 
differentiated maize tissues from an active Mutator plant. 
Tissue ratio is 4:2:1:1:1:1 (tassel, kernel, silk, husk, 
root, leaf) . Unidirectionally cloned. New library number 
given to library 707 for additional sequencing." 

BASE COUNT 105 a 141 c 169 g 103 t 

ORIGIN 



Query Match 83.6%; Score 351.8; DB 10; Length 518; 

Best Local Similarity 97.6%; Pred. No. 3.6e-74; 

Matches 367; Conservative 0; Mismatches 8; Indels 1; Gaps 1; 

Qy 46 ctctatcgacagcatcgtgaacggcagcgacgccgtcatggagccggttgttgctggcag 105 

I I I II I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I 
Db 1 CTCTATCGACAGCATCGTGAACGGCAGCGACGCCGTCATGGAGCCGGTTGTTGCTGGCAG 60 

Qy 106 caacacggtagtagctgttgcccaagttgagttgcaaacaatgaacgtgcagcagcccgc 165 

I I I I I II I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 11 I I I I II I I 1 I II I 
Db 61 CAACACGGTAGTAGCTGTTGCCCAAGTCGAGTTGCAAACAATGAACGTGCAGCAGCCCGC 120 

Qy 166 tgatgttgccggacccagcgagggggtggcggcgatctccaaaggggggaaggtanggcc 225 

I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I M I I I I M I I I I I I I II 
Db 121 TGATGTTGCCGGACCCAGCGAGGGGGTGGCGGCGATCTCCAAAGGGGGGAAGGTAGGGCC 180 

Qy 226 gaaccggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtgg 285 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I i I I I I I I I I I I 
Db 181 GAACCGGTGCAGCGCCTGCAGGAAGAGGGTTGGGCTCACGGGATTCAACTGCCGGTGTGG 24 0 

Qy 286 gaacttgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacg 345 

I I I I I I I I I I I I I I I I I ! I I I II I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I II 
Db 241 GAACTTGTACTGCGCGCTCCACCGCTACTCCGACAAGCACGACTGCT^GTTCGACTACCG 300 

Qy 346 gactgctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcg 405 

I I I I I I I I II I 1 I I I I I I I I I I I I I I I II i I I I M I I I I I I I I I I I I I M I I I I I I I I 
Db 301 GACTGCTGCCAGGGACGCCATTGCCAA-GGCTAATCCGGTGGTGAAGGCAGACAAGCTCG 359 

Qy 406 acaagatctaaggggg 421 

I II I I I I I I I I I I I I 
Db 360 ACAAGATCTAGGGGGG 37 5 



RESULT 7 

AI881804 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



AI881804 594 bp mRNA EST 02-FEB-2000 

606073E02.yl 606 - Ear tissue cDNA library from Schmidt lab Zea 
mays cDNA, mRNA sequence. 
AI881804 

AI881804 .1 GI : 5566893 
EST. 

Zea mays . 
Zea mays 

Eukaryota; Viridiplantae ; Streptophyta; Embryophyta; Tracheophyta; 



Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae; Zea. 
REFERENCE 1 (bases 1 to 594) 
AUTHORS Walbot,V. 

TITLE Maize ESTs from various cDNA libraries sequenced at Stanford 

University 
JOURNAL Unpublished (1999) 
COMMENT Contact: Walbot V 

Department of Biological Sciences 

Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 
Tel: 650 723 2227 
Fax: 650 725 8221 
Email : walbot@stanford.edu 
Plate: 606073 row: E column: 02. 
FEATURES Location/Qualifiers 
source 1. .594 

/organism="Zea mays" 

/cultivar="Ohio43" 

/db_xref ="taxon : 4 577 " 

/clone_lib="606 - Ear tissue cDNA library from Schmidt 
lab" 

/tissue_type="mixed" 

/dev_stage="ear length from 0 . 5 cm - 2 . 0 cm" 
/lab_host-"XLOLR (Stratagene) " 

/note="Organ: immature ear; Vector: pBK-CMV; Site_l: EcoRI 
; Site_2: Xhol; Mixed ear tissue cDNA library from Schmidt 
lab" 

BASE COUNT 132 a 173 c 191 g 98 t 

ORIGIN 



Query Match 82.9%; Score 348.8; DB 10; Length 594; 

Best Local Similarity 95.2%; Pred. No. 1.9e-73; 

359; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 

gcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgc 7 8 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I M I M II II I I I I I I I II I I I I I I 
GCAGGATCAGGCCAAGCTGGCTGCCTCCTCTATCGACAGCATCGTGAACGGCAGCGACGC 27 4 

cgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagtt 138 

I I I II I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I INN 
CGTCATGGAGCCGGTTGTTGCTGGCAGCAACACGGTAGT AGCTGTTGCCCAAGTCGAGTT 334 

gcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggc 198 

I I I I II I I II II I I I II II I I II II I II I II I N I I I I I I I II N I II I II I I I I I II II 
GCAAACAATGAACGTGCAGCAGCCCGCTGATGTTGCCGGACCCAGCGAGGGGGTGGCGGC 394 



I I I I M I I || II I II I II II I I N II I I I I I I II II I II N II I I N I I I II II II I II 
GATCTCCAAAGGGGGGAAGGTAGGGCCGAACCGGTGCAGCGCCTGCAGGAAGAGGGTTGG 4 54 

acttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccga 318 

II I I I I N II I II I II I II I II II I I II I II II II I I II I I II II II I I II I I I I N 
GCTCACGGGATTCAACTGCCGGTGTGGGAACTTGTACTGCGCGCTCCACCGCTACTCCGA 514 



Matches 


Qy 


19 


Db 


215 


Qy 


79 


Db 


275 


Qy 


139 


Db 


335 


Qy 


199 


Db 


395 


Qy 


259 


Db 


455 



Qy 



319 caagcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaagggcta 378 



Db 515 CAAGCACGACTGCAAGTTCGACTACCGGACTGCTGCCAGGGACGCCATTGCCAAGGCTAA 574 

Qy 379 atccggtggtgaaggca 395 

Mil Mill 
Db 57 5 TCCGGGGGAAGCAGACA 591 



RESULT 8 

BG355155 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BG355155 488 bp mRNA EST 06-MAR-2001 

947043D10.yl 947 - 2 week shoot from Barkan lab Zea mays cDNA, mRNA 
sequence. 
BG355155 

BG355155. 1 GI : 13237141 
EST. 

Zea mays . 
Zea mays 

Eukaryota; Viridiplantae; Streptophyta ; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae ; Zea. 
1 (bases 1 to 488) 
Walbot, V. 

Maize ESTs from various cDNA libraries sequenced at Stanford 
University 
Unpublished (1999) 
Contact: Walbot V 

Department of Biological Sciences 
Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 

Tel: 650 723 2227 

Fax: 650 725 8221 

Email : walbot@stanford.edu 

Plate: 947043 row: D column: 10. 

Location/Qualifiers 

1. .488 

/organism="Zea mays" 
/cultivar="B73" 
/db_xref="taxon: 4 577" 

/clone_lib="947 - 2 week shoot from Barkan lab" 
/tissue_type="leaf and stem, including leaf base" 
/dev_stage="2 week old seedling (3 leaves)" 
/lab_host="XLl-Blue" 

/note="0rgan: shoot; Vector: Lambda ZAP (pBlueScript SK-) ; 
Site_l: EcoRI; Site_2: Xhol; Directionally cloned using 
Stratagene's UniZap XR cDNA cloning kit with the 5 1 end 
at the EcoRI site. The library represents 8 x 10e5 
independent recombinant phage. The plants were greenhouse 
grown." 

Ill a 136 c 156 g 85 t 



Query Match 78.7%; 
Best Local Similarity 99.4%; 
Matches 332; Conservative 



Score 331.4; DB 11; Length 488; 
Pred. No. 2.7e-69; 
0; Mismatches 2; Indels 0; Gaps 



0; 



Qy 

Db 



19 gcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgc 7 8 

M I I I I I I I I I I I I I M I I I I II I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I! I I I I I I 
155 GCAGGATCAGGCCAAGCTGGCTGCCTCCTCTATCGACAGCATCGTGAACGGCAGCGACGC 214 



Qy 7 9 cgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagtt 138 

I I I I I I I I M I II I I I I I I I I I I I I I I I I I I I I I I.M I I I II I I I I I I I I I I I I I II I I I 
Db 215 CGTCATGGAGCCGGTTGTTGCTGGCAGCAACACGGTAGTAGCTGTTGCCCAAGTTGAGTT 27 4 

Qy 139 gcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggc 198 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 275 GCAAACAATGAACGTGCAGCAGCCCGCTGATGTTGCCGGACCCAGCGAGGGGGTGGCGGC 334 

Qy 199 gatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttgg 258 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 335 GATCTCCAAAGGGGGGAAGGTAGGGCCGAACCGGTGCAGCGCCTGCAGGAAGAGGGTTGG 394 

Qy 259 acttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccga 318 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 395 ACTTACGGGATTCAACTGCCGGTGTGGGAACTTGTACTGCGCACTCCACCGCTACTCCGA 4 54 

Qy 319 caagcacgactgcaagttcgactaacggactgct 352 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 4 55 CAAGCACGACTGCAAGTTCGACTACCGGACTGCT 4 88 



RESULT 9 
BE510926/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BE510926 482 bp mRNA EST 07-AUG-2000 

946056E05.xl 946 - tassel primordium prepared by Schmidt lab Zea 
mays cDNA, mRNA sequence. 
BE510926 

BE510926.1 GI:9732174 
EST. 

Zea mays . 
Zea mays 

Eukaryot a ; Vir idiplantae ; St reptophyta ; Embryophyta ; Tracheophyt a ; 
Spermatophyta; Magnoliophyta ; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae ; Zea. 
1 (bases 1 to 482) 
Walbot,V. 

Maize ESTs from various cDNA libraries sequenced at Stanford 
University 
Unpublished (1999) 
Contact: Walbot V 

Department of Biological Sciences 
Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 

Tel: 650 723 2227 

Fax: 650 725 8221 

Email : walbot @stanf ord . edu 

Plate: 946056 row: E column: 05. 

Location/Qualif iers 

1. .482 

/organism="Zea mays" 
/cultivar="OH43" 
/db_xref="taxon: 4577" 

/clone_lib="94 6 - tassel primordium prepared by Schmidt 



lab" 

/t is sue_type=" tassels" 

/dev_stage=" just after the transition from vegetative to 

inflorescence development" 

/lab_host="XLOLR" 

/note="Organ : tassels; Vector: HybriZAP; Site_l: EcoRI; 
Site_2: Xhol; George Chuck dissected immature tassels 
between 1mm and 3mm. Sharon Stanfield prepared the cDNA 
library in HybriZAP. Sample insert size range was 350 bp 
to 3 Kb with a 1 Kb average." 

BASE COUNT 98 a 158 c 131 g 95 t 

ORIGIN 



Query Match 77.1%; Score 324.8; DB 10; Length 482; 

Best Local Similarity 97.4%; Pred. No. le-67; 

Matches 340; Conservative 0; Mismatches 8; Indels 1; Gaps 1; 

Qy 73 cgacgccgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagt 132 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 82 CGACGCCGTCATGGAGCCGGTTGTTGCTGGCAGCAACACGGTAGTAGCTGTTGCCCAAGT 423 

Qy 133 tgagttgcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggt 192 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 22 CGAGTTGCAAACAATGAACGTGCAGCAGCCCGCTGATGTTGCCGGACCCAGCGAGGGGGT 3 63 

Qy 193 ggcggcgatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagag 252 

I I I I I I I I II I I I I I I I I I I I. I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 362 GGCGGCGATCTCCAAAGGGGGGAAGGTAGGGCCGAACCGGTGCAGCGCCTGCAGGAAGAG 303 

Qy 253 ggttggacttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgcta 312 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 302 GGTTGGGCTCACGGGATTCAACTGCCGGTGTGGGAACTTGTACTGCGCGCTCCACCGCTA 243 

Qy 313 ctccgacaagcacgactgcaagttcgactaacggactgctgccaaggacgccattgccaa 372 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I M II I I 
Db 242 CTCCGACAAGCACGACTGCAAGTTCGACTACCGGACTGCTGCCAGGGACGCCATTGCCAA 183 

Qy 373 gggctaatccggtggtgaaggcagacaagctcgacaagatctaaggggg 421 

I I I II I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I 
Db 182 -GGCTAATCCGGTGGTGAAGGCAGACAAGCTCGACAAGATCTAGGGGGG 135 



RESULT 10 

AW927909 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 



AW92.7 909 567 bp mRNA EST 30-MAY-2000 

945010H04.Y1 945 - Mixed adult tissues from Walbot lab, same as 707 
(SK) Zea mays cDNA, mRNA sequence. 
AW927909 

AW927909.1 GI:8103259 
EST. 

Zea mays. 
Zea mays 

Eukaryota; Viridiplantae ; Streptophyta; Embryophyta; Tracheophyta ; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae; Zea. 
1 (bases 1 to 567) 



AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Walbot,V. 

Maize ESTs from various cDNA libraries sequenced at Stanford 
University 
Unpublished (1999) 
Contact: Walbot V 
Department of Biological Sciences 
Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 
Tel: 650 723 2227 
Fax: 650 725 8221 
Email : walbot@stanford.edu 
Plate: 945010 row: H column: 04. 

Location/Qualifiers 

1. .567 

/organism="Zea mays" 
/cultivar="W23" 
/db_xref="taxon: 4577" 
/clone_lib="945 - Mixed adult 
same as 707 (SK) " 
/tissue_type=" tassel , kernal, 
/dev_stage=" fully-grown" 
/lab_host=="DH10B" 

/note="0rgan : tassel, kernal, silk, husk, root, leaf; 
Vector: pGADIO; Site_l : EcoRI; cDNA library from fully 
differentiated maize tissues from an active Mutator plant. 
Tissue ratio is 4:2:1:1:1:1 (tassel, kernel, silk, husk, 
root, leaf) . Unidirect ionally cloned. New library number 
given to library 707 for additional sequencing." 
113 a 147 c 177 g 130 t 



tissues from Walbot lab, 
silk, husk, root, leaf" 



Query Match 74.3%; 
Best Local Similarity 97.3%; 
Matches 328; Conservative 



Score 312.8; DB 10; Length 567; 
Pred. No. 7.4e-65; 
0; Mismatches 8; Indels 1; 



Gaps 



1; 



Qy 85 ggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagttgcaaac 

I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1 GGAGCCGGTTGTTGCTGGCAGCAACACGGTAGTAGCTGTTGCCCAAGTCGAGTTGCAAAC 



144 



60 



Qy 

Db 

Qy 
Db 



145 



61 



205 



aatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggcgatctc 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II M I I I I I I I I I I I I i I I I I I I I I 
AATGAACGTGCAGCAGCCCGCTGATGTTGCCGGACCCAGCGAGGGGGTGGCGGCGATCTC 



caaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttggacttac 

I I I II I I I I I I I I I I I I I I I I I M M I I ! I I I I I I I I I I I I I I I II I I I I I I I M M 
121 CAAAGGGGGGAAGGTAGGGCCGAACCGGTGCAGCGCCTGCAGGAAGAGGGTTGGGCTCAC 



204 



120 



264 



180 



Qy 

Db 

Qy 

Db 



2 65 gggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccgacaagca 324 

I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
181 GGGATTCAACTGCCGGTGTGGGAACTTGTACTGCGCGCTCCACCGCTACTCCGACAAGCA 24 0 

325 cgactgcaagttcgactaacggactgctgccaaggacgccattgccaagggctaatccgg 384 

I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
241 CGACTGCAAGTTCGACTACCGGACTGCTGCCAGGGACGCCATTGCCAA-GGCTAATCCGG 2 99 



Qy 



385 tggtgaaggcagacaagctcgacaagatctaaggggg 421 



Db 



300 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGGTGAAGGCAGACAAGCTCGACAAGATCTAGGGGGG 



336 



RESULT 11 
AI977903/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AI977903 643 bp mRNA EST 27-AUG-1999 

496035D05.xl 496 - stressed shoot cDNA library from Wang/Bohnert 
lab Zea mays cDNA, mRNA sequence. 
AI977903 

AI977903.1 GI:5791111 
EST. 

Zea -mays . 
Zea mays 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae; Zea. 
1 (bases 1 to 643) 
Walbot, V. 

Maize ESTs from various cDNA libraries sequenced at Stanford 
University 
Unpublished (1999) 
Contact: Walbot V 

Department of Biological Sciences 
Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 

Tel: 650 723 2227 

Fax: 650 725 8221 

Email : walbot@stanford.edu 

Plate: 496035 row: D column: 05. 

Location/Qualifiers 

1. .643 

/organism="Zea mays" 

/cultivar="B73" 

/db_xref="taxon:4577" 

/clone_lib="496 - stressed shoot cDNA library from 
Wang/Bohnert lab 1 * 
/tissue_type=" seedling" 
/dev_stage="salt stress" 
/lab_host="E.coli XL Gold" 

/note="Organ: shoot; Vector: pBluescriptll SK(+) XR; 
Wang/Bohnert" 
164 a 192 c 159 g 128 t 



Query Match 69.0%; Score 290.6; DB 10; Length 643; 

Best Local Similarity 98.1%; Pred. No. 1.5e-59; 

Matches 304; Conservative 0; Mismatches 5; Indels 1; Gaps 

Qy 112 ggtagtagctgttgcccaagttgagttgcaaacaatgaacgtgcagcagcccgctgatgt 171 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 64 3 GGTAGTAGCTGTTGCCCAAGTTGAGTTGCAACCAATGAACGTGCAGCAGCCCGCTGATGT 584 

Qy 172 tgccggacccagcgagggggtggcggcgatctccaaaggggggaaggtanggccgaaccg 231 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 583 TGCCGGACCCAGCGAGGGGGTGGCGGCGATCTCCAAAGGGGGGAAGGTAGGGCCGAACCG 524 



Qy 232 gtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtgggaactt 291 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I M I I I I 
Db 523 GTGCAGCGCCTGCAGGAAGAGGGTTGGACTTACGGGATTCAACTGCCGGTGTGGGAACTT 4 64 

Qy 292 gtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacggactgc 351 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 63 GTACTGCGCACTCCACCGCTACTCCGACAAGCACGACTGCAAGTTCGACTACCGGACTGC 4 04 

Qy 352 tgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcgacaaga 411 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 03 TGCCAGGGACGCCATTGCCAA-GGCTAATCCGGTGGTGAAGGCAGACAAGCTCGACAAGA 34 5 

Qy 412 tctaaggggg 421 

I I I I I I I I I 
Db 34 4 TCTAGGGGGG 335 



RESULT 12 
AW061851/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



AW061851 514 bp mRNA EST 30-MAR-2000 

687007F12.xl 687 - Early embryo from Delaware Zea mays cDNA, mRNA 
sequence . 
AW061851 

AW061851.1 GI:6012414 
EST. 

Zea mays. 
Zea mays 

Eukaryota ; Viridiplantae ; St reptophyta ; Embryophyta ; Tracheophyta ; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae ; Zea. 
1 (bases 1 to 514) 
Walbot, V. 

Maize ESTs from various cDNA libraries sequenced at Stanford 
University 
Unpublished (1999) 
Contact: Walbot V 

Department of Biological Sciences 
Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 

Tel: 650 723 2227 

Fax: 650 725 8221 

Email : walbot@stanf ord . edu 

Plate: 687007 row: F column: 12. 

Location/Qualifiers 

1. .514 

/organism="Zea mays" 
/cultivar="Illinois High Oil" 
/db_xref="taxon: 4577" 

/clone_lib="687 - Early embryo from Delaware" 
/ 1 i s sue_t ype= " embryo " 

/dev_stage="14, 21, 28, and 35 days after pollination" 
/lab_host="E. coli SOLR" 

/note="Organ: embryo; Vector: pBluescript SK; Site_l: Xhol 
; Site_2: EcoRI; Library was prepared by Statagene using 
the Uni-ZAP XR system (Stratagene BN937328-12) . Clones 
were picked by a Q-bot after blue/white selection 



(ampicillin resistance - use 100 micrograms/microliter) . 
Developed from a pool of equal amounts of RNA from 
developing embryos sampled at 14, 21, 28 and 35 days after 
pollination of the Illinois High Oil Maize Strain Cycle 
90. This closed strain has been selected for high oil 
concentration for 90 generations and originates from the 
1890s era open pollinated variety Burr's White" 

BASE COUNT 116 a 156 c 138 g 104 t 

ORIGIN 

Query Match 68.9%; Score 290; DB 10; Length 514; 

Best Local Similarity 97.7%; Pred. No. 2e-59; 

Matches 304; Conservative 0; Mismatches 6; Indels 1; Gaps 1; 

Qy 111 cggtagtagctgttgcccaagttgagttgcaaacaatgaacgtgcagcagcccgctgatg 17 0 

II I I I I I I I I I M I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I ! 
Db 513 CGGTAGTAGCTGTTGCCCAAGTTGAGTTGCAAACAATGAACGTGCAGCAGCCCGCTGATG 4 54 

Qy 171 ttgccggacccagcgagggggtggcggcgatctccaaaggggggaaggtanggccgaacc 230 

I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 53 TTGCCGGACCCAGCGAGGGGGTGGCGGCGATCTCCAAAGGGGGGAAGGTAGGCCCGAACC 394 

Qy 231 ggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtgggaact 290 

I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I t I I I II I I I I I I I I I I I I I I I M 
Db 393 GGTGCAGCGCCTGCAGGAAGAGGGTTGGACTTACGGGATTCAACTGCCGGTGTGGGAACT 334 

Qy 291 tgtactgcgcactccaccgctactccgacaagcacgactgcaagttcgactaacggactg 350 

I I I I I II I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I III III 
Db 333 TGTACTGCGCACTCCACCGCTACTCCGACAAGCACGACTGCAAGTTCGACTACCGGTCTG 274 

Qy 351 ctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcgacaag 410 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 273 CTGCCAGGGACGCCATTGCCAA-GGCTAATCCGGTGGTGAAGGCAGACAAGCTCGACAAG 215 

Qy 411 atctaaggggg 421 

I I I I I I I I I I 
Db 214 ATCTAGGGGGG 204 



RESULT 13 

AI649645 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



AI649645 527 bp mRNA EST 30-APR-1999 

486077B07.yl 486 - leaf primordia cDNA library from Hake lab Zea 
mays cDNA, mRNA sequence. 
AI649645 

AI649645.1 GI:4730479 
EST. 

Zea mays. 
Zea mays 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae ; Zea. 
1 (bases 1 to 527) 
Walbot, V. 

Maize ESTs from various cDNA libraries sequenced at Stanford 
University 



JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Unpublished (1999) 

Contact: Walbot V 

Department of Biological Sciences 

Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 

Tel: 650 723 2227 

Fax: 650 725 8221 

Email : walbot @stanf ord . edu 

Plate: 486077 row: B column: 07. 

Location/Qualifiers 

1. .527 

/organism="Zea mays" 

/cultivar="B73" 

/db_xref="taxon:4577" 

/clone_lib="486 - leaf primordia cDNA library from Hake 
lab" 

/tissue_type="leaf primordia" 
/dev_stage="P7-Pll leaf" 
/lab_host="E.coli XLl-Blue MFR 1 " 

/note="Organ: shoot; Vector: Lambda zap; Hake lab cDNA 
library. " 
117 a 151 c 169 g 90 t 



Query Match 68.3%; Score 287.4; DB 10; Length 527; 

Best Local Similarity 99.0%; Pred. No. 8.5e-59; 



Matches 


299; Conservative 0; Mismatches 2; Indels 1; Gaps 


Qy 


19 


gcaggatcaggccaagctggctgcctcctctatcgacagcatcgtgaacggcagcgacgc 


78 






I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 t 1 1 1 M 1 I 1 1 1 1 M 1 1 1 II 




Db 


226 


GCAGGATCAGGCCAAGCTGGCTGCCTCCTCTATCGACAGCATCGTGAACGGCAGCGACGC 


285 


Qy 


79 


cgtcatggagccggttgttgctggcagcaacacggtagtagctgttgcccaagttgagtt 


138 






M I I I I 1 M 1 II 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 i 1 1 1 




Db 


286 


CGTCATGGAGCCGGTTGTTGCTGGCAGCAACACGGTAGTAGCTGTTGCCCAAGTTGAGTT 


345 


Qy 


139 


gcaaacaatgaacgtgcagcagcccgctgatgttgccggacccagcgagggggtggcggc 


198 






I 1 M 1 1 II 1 1 1 1 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


346 


GCAAACAATGAACGTGCAGCAGCCCGCTGATGTTGCCGGACCCAGCGAGGGGGTGGCGGC 


405 


Qy 


199 


gatctccaaaggggggaaggtanggccgaaccggtgcagcgcctgcaggaagagggttgg 


258 






1111111111111111111111 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 




Db 


406 


GATCTCCAAAGGGGGGAAGGTAGGGCCGAACCGGTGCAGCGCCTGCAGGAAGAGGGTTGG 


465 


Qy 


259 


acttacgggattcaactgccggtgtgggaacttgtactgcgcactccaccgctactccga 


318 






1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


466 


ACTTACGGGATTCAACTGCCGG-GGGGGAACTTGTACTGCGCACTCCACCGCTACTCCGA 


524 


Qy 


319 


ca 320 
1 1 




Db 


525 


CA 526 





1; 



RESULT 14 
BE129734/C 

LOCUS BE129734 599 bp mRNA EST 21-JUN-2000 



DEFINITION 946003G07.X1 946 - tassel primordium prepared by Schmidt lab Zea 

mays cDNA, mRNA sequence. 
ACCESSION BE129734 ' 
VERSION BE129734.1 GI:8577097 

KEYWORDS EST. 
SOURCE Zea mays. 

ORGANISM Zea mays 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta ; 

Spermatophyta ; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 

clade; Panicoideae; Andropogoneae; Zea. 
REFERENCE 1 (bases 1 to 599) 
AUTHORS Walbot,V. 

TITLE Maize ESTs from various cDNA libraries sequenced at Stanford 

University 
JOURNAL Unpublished (1999) 
COMMENT Contact: Walbot V 

Department of Biological Sciences 

Stanford University 

855 California Ave, Palo Alto, CA 94304, USA 
Tel: 650 723 2227 
Fax: 650 725 8221 
Email : walbot@stanf ord . edu 
Plate: 946003 row: G column: 07. 
FEATURES Location/Qualifiers 
source 1. .599 

/organism="Zea mays" 

/cultivar="OH43" 

/db_xref="taxon: 4577" 

/clone_lib="94 6 - tassel primordium prepared by Schmidt 
lab" 

/tissue_type=" tassels" 

/dev_stage=" just after the transition from vegetative to 

inflorescence development" 

/lab_host="XLOLR" 

/note="Organ: tassels; Vector: HybriZAP; Site_l: EcoRI; 
Site_2: Xhol; George Chuck dissected immature tassels 
between 1mm and 3mm. Sharon Stanfield prepared the cDNA 
library in HybriZAP. Sample insert size range was 350 bp 
to 3 Kb with a 1 Kb average." 

BASE COUNT 149 a 182 c 152 g 116 t 

ORIGIN 



Query Match 65.7%; Score 276.8; DB 10; Length 599; 

Best Local Similarity 97.0%; Pred. No. 2.9e-56; 



Matches 


292; Conservative 0; Mismatches 8; Indels 1; Gaps 


Qy 


121 


tgttgcccaagttgagttgcaaacaatgaacgtgcagcagcccgctgatgttgccggacc 


180 






1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 i 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 I 1 1 1 




Db 


599 


TGTTGCCCAAGTCGAGTTGCAAACAATGAACGTGCAGCAGCCCGCTGATGTTGCCGGACC 


540 


Qy 


181 


cagcgagggggtggcggcgatctccaaaggggggaaggtanggccgaaccggtgcagcgc 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


539 


CAGCGAGGGGGTGGCGGCGATCTCCAAAGGGGGGAAGGTAGGGCCGAACCGGTGCAGCGC 


480 



Qy 



241 ctgcaggaagagggttggacttacgggattcaactgccggtgtgggaacttgtactgcgc 300 
I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 47 9 CTGCAGGAAGAGGGTTGGGCTCACGGGATTCAACTGCCGGTGTGGGAACTTGTACTGCGC 420 

Qy 301 actccaccgctactccgacaagcacgactgcaagttcgactaacggactgctgccaagga 360 

I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I III 
Db 419 GCTCCACCGCTACTCCGACAAGCACGACTGCAAGTTCGACTACCGGACTGCTGCCAGGGA 360 

Qy 3 61 cgccattgccaagggctaatccggtggtgaaggcagacaagctcgacaagatctaagggg 420 

II I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 359 CGCCATTGCCAA-GGCTAATCCGGTGGTGAAGGCAGACAAGCTCGACAAGATCTAGGGGG 301 

Qy 421 g 421 
I 

Db 300 G 300 



RESULT 15 
W99253 

LOCUS W99253 660 bp mRNA EST 17-JUL-1996 

DEFINITION EST01 Glucose-starved maize root tip cDNA library Zea mays cDNA 

clone MZ2B1 3 ! similar to Phaseolus vulgaris PVPR3 , mRNA sequence 
ACCESSION W99253 
VERSION W99253.1 GI:1432180 

KEYWORDS EST. 
SOURCE Zea mays. 

ORGANISM Zea mays 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta 
Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; PACC 
clade; Panicoideae; Andropogoneae ; Zea. 
REFERENCE 1 (bases 1 to 660) 

AUTHORS Couee,I. and Chevalier, C. 
TITLE Maize root cDNAs 

JOURNAL Unpublished (1996) 
COMMENT Contact: Couee I 

Station d 1 amelioration des arbres forestiers 
Institut National de la Recherche Agronomique 
F-45160 Ardon, France 
Fax: 33-38417879 
Email: couee@orleans.inra.fr 
Similarity determined by BLASTX. 
FEATURES Location/Qualifiers 
source 1 . . 660 

/organism="Zea mays" 
/cultivar="DEA" 
/db_xref="taxon:4577" 
/clone="MZ2Bl" 

/clone_lib="Glucose-starved maize root tip cDNA library" 
/tissue_type="3-mm-long tip" 

/note="Organ : Primary root; Vector: Lambda ZAPII; cDNA 
library from poly (A) + mRNA of 24-h glucose-starved maize 
root tips, as described in Chevalier, C, Bourgeois, E., 
Pradet, A., Raymond, P. (1995) Plant Molecular Biology 
Volume 28 Pages 473-485" 

BASE COUNT 142 a 165 c 196 g 154 t 3 others 

ORIGIN 



Query Match 



62.8%; Score 264.2; DB 11; Length 660; 



Best Local Similarity 95.1%; Pred. No. 3e-53; 



Matches 


292; Conservative 0; Mismatches 12; Indels 3; Gaps 
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gtagtagctgttgcccaagttgagttgcaaacaatgaacgtgcagcagcccgctgatgtt 
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biAbl Abb Ibl I bbbbAAb I bbAb 1 I bbAAAbAAl bAAbb 1 bbAbbAbbbbbbNbrtl b 1 1 
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gccggacccagcgaggggg — tggcggcgatctccaaaggggggaaggtanggccgaacc 


230 
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GbCbbAbbbAbbbAbbbbbbl 1 bbbbbbbAl b 1 bbAAAbbbbbbAAbbl AbbbbbbAAbb 
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Qy 


231 


ggtgcagcgcctgcaggaagagggttggacttacgggattcaactgccggtgtgggaact 


290 
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Db 


121 


GGTGCAGCGCCTGCAGGAAGAbbb 1 1 bbbb 1 LAbbbbA 1 1 bAAb 1 bbbbb ibl bbbAAL i 


i on 
1 0 U 


Qy 


2 91 


tgtactgcgcact ccaccgctactccgacaagcacgact gcaagtt cgact aacggact g 
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Db 


181 


TGTACTGCNCCCTCCACCGCTACTCCGACAAGCACGACTGCAAGTTCGACTACCGGACTG 


240 


Qy 


351 


ctgccaaggacgccattgccaagggctaatccggtggtgaaggcagacaagctcgacaag 


410 
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Db 


241 


CTGCCAGGGATNCCATTGCCAA-GGCTAATCCGGTGGTGAAGGCAGACAAGCTCGACAAG 


299 


Qy 


411 


atctaag 417 




Db 


300 


1 1 1 1 1 1 
ATCTAGG 306 





Search completed: February 7, 2002, 08:20:51 
Job time: 18128 sec 



